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ABSTRACT 


This is an analysis of strategic voting under the plurality rule. Existing theories predict strict 
bipartism, where rational voters support only two candidates: a strict interpretation of Du- 
verger’s Law. This conclusion is rejected. The new theory employs a simple model of a three- 
candidate election. Unlike previous models, the level of popular support for each candidate 
is not commonly certain. Voters form opinions of candidate support from public and private 
signals. Unless knowledge of constituency-wide factors is both common and precise, there is a 
uniquely stable non-Duvergian equilibrium, with only partial strategic voting. Strategic voting 
is increasing in the precision of voter beliefs, the expected strength of a leading candidate, and 
the expected gap between challenging candidates. The effect of information depends critically 
on its source. Public signals of candidate support have a far stronger effect than privately 
observed signals. Surprisingly, when voting decisions are based largely on private rather than 


public information, strategic voting is self-attenuating rather than self-reinforcing. 


Acknowledgements: Stephen D. Fisher inspired this work with his extensive empirical re- 
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EXTENDED ABSTRACT 


A New Theory of Strategic Voting describes a new theory of strategic voting in plurality 
electoral systems. Existing theories are shown to rest on the assumed independence of voter 
preferences. Replacing this assumption with one of uncertain candidate support, the new 
theory models both the public and private information sources that enter into a voter’s decision. 
Suprisingly, with pure private information sources, strategic voting is self-attenuating and not 
self-reinforcing. This leads to a rejection of strict bipartism, and a new range of comparative 
static results. Crucially, and contrary to existing theories, the model shows that multi-candidate 
support in plurality elections is perfectly consistent with rational voting. The theory is presented 


in a sequence of three parts. 


Part 1 provides a critique of existing models. Following earlier work, it observes that the key 
criterion for a strategic voter is the relative likelihood of pivotal ties between the competing 
candidates. Existing models impose independence on the preferences of voters, yielding inde- 
pendence of voting decisions. Under this scenario, the relative likelihood of a tie between the 
two leading candidates in an election becomes infinitely greater than a tie between any other 
pair for large electorates, yielding a strict bipartite outcome with rational voters. Following the 
introduction of uncertainty over constituency-wide support, this result fails. It follows that in a 
decision-theoretic framework, a rational electorate gives support to all candidates. Crucially, it 
only the constituency uncertaint that matters in determining voting behaviour. Intuitively, any 


idiosyncratic uncertainty on the part of individual voters is averaged out in a large electorate. 


Part 2 moves to a game theoretic framework. The model builds a microfoundation for con- 
stituency uncertainty in an election, by allowing for unknown constituency-wide common ef- 
fects. Beliefs over such effects (and hence candidate support) are elicited from observed signals, 
corresponding to samples of preferences from the electorate. To separate the effects of certain 
support and common knowledge of expected candidate standings, the analysis restricts to public 
signals. Strategic voting is self-reinforcing, and with sufficient information precision, a bipartite 
outcome emerges. Importantly, the analysis shows that it is information precision, rather than 
large electorates, that can lead to bipartism. This precision can be indexed by the size of an 


opinion poll: Allowing the opinion poll sample size to expand leads to bipartism. 


Part 3 overturns this result. It argues that, at a constituency level, information sources are 
likely to be privately observed. Specifying a model in which each voter observes a private 


signal of candidate support, it arrives at the surprising conclusion that strategic voting is self- 
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attenuating rather than self-reinforcing. Anticipation of strategic voting by others dampens 
strategic incentives, hence reducing the equilibrium level of strategic switching. It follows 
that strategic voting is once again incomplete, and multi-candidate support emerges. With 
a combination of public and private information sources, a bipartite outcome requires public 


signals to be sufficiently precise relative to private signals. 


Parts 1-3 display common comparative statics. Strategic voting is increasing in the precision 
of information, the homogeneity of the electorate, and the perceived gap between challenging 
candidates. Contrary to earlier informal analyses, there is little strategic voting in a close 
election. The theory shows that the lack of common knowledge of candidate standing can 


prevent the coordination of challenging candidates in the defeat of a disliked third party. 
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PART 1 


DECONSTRUCTING DUVERGER’S LAW 


“The whole problem with the world is that fools and fanatics are 


always so certain of themselves, but wiser people so full of doubts.” 


Bertrand Russell 


1.1 INTRODUCTION 


1.1.1 Duverger’s Law 


The tendency of plurality voting to yield a two-party system is a long-held tenet. In 1951, 
Maurice Duverger elevated this hypothesis to legal status. In an oft-quoted passage, he stated 
that the hypothesis “approaches most nearly perhaps to a true sociological law” (Duverger 1954, 
p. 217). This daring statement is, however, accompanied by qualifying language. As Riker 
(1982, p. 754) notes, Duverger’s hypotheses display some ambiguity, perhaps best exhibited by 
the statement of Duverger’s Law itself as “the simple-majority single-ballot system favors the 
two-party system” (Duverger 1954, p. 217, emphasis added). This stops short of claiming that 


a plurality system is sufficient for strict bipartism. 


The formulation of Duverger’s Law encompasses strategic interaction between both political 
parties and voters: The electorate responds to the stance of political parties via vote choice, and 
candidates act with reference to both past and future voting decisions. It is clear, however, that 
the influence of the voting mechanism must stem from its effect on voting behaviour.! Analysis, 
therefore, must include investigation of voting under the plurality rule, and the associated 


district level bipartism. Consideration of strategic voting and the wasted vote phenomenon 


‘Note that this may be anticipated rather than actual voter behaviour. There may only be two political 
parties, since an entrant anticipates performing poorly under the electoral system. It is still the effect of the 
voting mechanism on voter behaviour that drives this — even though it is a hypothesised election rather than 
an actual one. 
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follows — instrumentally rational voters may choose to vote for someone other than their 
preferred candidate. As Droop (1871) and many followers have observed, a voter may notice 
that their preferred candidate has little chance of winning the election. Mindful of “wasting 
their vote”, an individual may switch their vote to one of the leading contenders. Such strategic 


behaviour leads to erosion of support for lower placed candidates, favouring the leading pair. 


Formal analyses of strategic voting have led to a rather draconian amendment to Duverger’s 
Law. These models are surveyed by Gary Cox (1997), and claim that, under the plurality rule, 
only two candidates receive votes from an instrumentally rational electorate. Palfrey (1989) is 
a case in point — his paper presents a “mathematical proof” to enforce this strict version of 
Duverger’s Law: 

“ |.. with instrumentally rational voters and fulfilled expectations, multicandidate 


contests under plurality rule should result in only two candidates getting any votes.” 


Palfrey examines the stable Bayesian Nash equilibria of his model, and finds that the support 
for any third candidate vanishes completely as the electorate grows large. In contrast to this 
prediction many plurality voting systems, including those of the UK and India, exhibit multi- 
candidate support.2 Cox (1997) and others rightly recognise that their voting models are 
stylised abstractions, and hence one should not expect to observe their exact conclusions. In 
the light of Palfrey’s claim, it is tempting to attribute the absence of strict bipartite election 


outcomes to the lack of rationality of voters, or to miscoordination on their part. 


1.1.2 A New Theory of Strategic Voting 


The new theory rejects this assertion. It observes that the assumed independence of voter 
preferences drives existing models. In response, it develops a new model in which voters are 
uncertain of constituency wide-preferences, and carefully models both public and private infor- 
mation sources upon which individuals condition their voting decisions. Arguing that privately- 
observed information sources are of key importance at a district level, the analysis shows that 
in the private information case strategic voting is self-attenuating rather than self-reinforcing. 
This leads to non-Duvergerian conclusions, with multi-candidate support, and new comparative 
statics. Summarising, the observation of support for multiple candidates in plurality elections 


is perfectly consistent with rational behaviour on the part of individual voters. 


The formulation of the theory begins here. This part analyses the decision-theoretic case, and 


explores the implications of weakening the independence of voter preferences. Part 2 presents 


? Analysis of strategic voting in Britain is provided by Johnston and Pattie (1991), Lanoue and Bowler (1992) 
and Niemi, Whitten and Franklin (1992) inter alia. For an analysis of the Indian case, see Riker (1976). He 
hypothesises that reduced strategic voting is due to the presence of a clear Condorcet winner, against which 
strategic voting is futile. This is not supported by the analysis of the new theory. A full application to this case 
is left to future work. 
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a fully game-theoretic model, in which information pertaining to constituency-wide preferences 
is obtained from a public source. It demonstrates that sufficient precision in public signals is 
required in order to obtain a stable Duvergerian outcome. Finally, Part 3 extends the model to 
include private signals. The analysis shows that strategic voting may exhibit negative feedback, 


and reaches a non-Duvergerian conclusion. 


1.1.3 Voting and Preference Independence 


The argument begins here in Part 1 by noting that a key assumption drives Duvergerian results 
— the authors cited here all specify that the preferences of voters are drawn independently from 
a commonly known distribution. Given a symmetric Bayesian strategy profile, independence 
of preferences leads to the independence of voting decisions. In particular, the leading pair 
of candidates in an plurality election is then commonly known. Cox (1997, p. 78) himself 


recognises this fact, noting that: 


“(A condition] necessary to generate pure local bipartism is that the identity of 


trailing and front-running candidates is common knowledge.” 


He hypothesises that failure of this condition is akin to miscoordination in the Battle of the 
Sexes. A rather more subtle mechanism is at work, however. To see this, and following Hoff- 
man (1982), Palfrey (1989), Myerson and Weber (1993) and others, note that in a plurality 
mechanism it is not the voting probabilities per se that are of interest to a voter. Indeed, a 
single individual can only influence the outcome of an election in the event of a tie for the lead. 
A casting vote may then determine the winning candidate. Of interest, therefore, is the relative 
likelihood of ties between different pairs of candidates. With independent voting probabilities, 
the “pivotal” probability of a tie between the two leading candidates becomes infinitely greater 
than a tie between any other pair, as the electorate grows large. It follows that a rational voter 


chooses one of the two leading candidates, and a Duvergerian outcome obtains. 


If preference realisations are not independent and voting strategies are non-degenerate, then 
this result fails. The likelihood ratio of a tie between the two leading candidates versus a tie 
between any other pair converges to a finite value as the electorate grows large. Any voter with 
a sufficiently strong preference for a win by a third candidate will not vote strategically — a 


non-Duvergerian outcome. 


The analysis explores this observation in more detail. A simple model of a three candidate 
election is constructed. The setting will be familiar to readers of Fey (1997). There are two 
types of voter. The first prefer the first candidate. The second type prefer both of the remaining 
candidates to the first. The model focuses on strategic switching between the latter candidates. 
Importantly, and in contrast to earlier analyses, the constituency-wide support for these latter 
candidates is uncertain. Constructing the optimal voting rule, the key statistic is the pivotal log- 


likelihood ratio of a tie between each of the two challenging candidates and the first candidate. 
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As the constituency size grows large, this converges to a finite value, and hence there is a finite 
incentive for a decision-theoretic strategic voter to switch candidates. It follows that strategic 


voting is incomplete. 


The critical observation is that it is only the uncertainty over constituency-wide support that 
drives strategic voting. To see this, notice that uncertainty over individual preferences and 
hence decisions is a necessary component of strategic voting. If the outcome of the election is 
certain, then (except in the case of a tie) a single vote cannot influence the outcome. Uncertainty 
over preferences (and hence the election outcome) results in the possibility (however remote) of 
a tied outcome, and hence a role for an individual voter. Consider the possible sources of such 
uncertainty. First, fixing the distribution of voter preferences, the decision of each individual 
is uncertain, since each voter is a single draw from such a distribution. This is idiosyncratic 
uncertainty. Secondly, a typical voter may be uncertain of the constituency-wide distribution 
from which preferences are drawn. This is constituency uncertainty. The Cox-Palfrey model 
assumes that the preference distribution is known. It follows that their results are entirely driven 
by idiosyncratic uncertainty. By contrast, when both sources of uncertainty are present, it is 
only constituency uncertainty that matters. The intuition is clear. In a large constituency, the 
idiosyncrasies of individuals are averaged out. Constituency uncertainty, however, is common 
to all and hence not averaged out by the Law of Large Numbers. It follows that the Cox-Palfrey 


framework is driven by the wrong factors. 


Many of the ideas offered both here and in the subsequent parts have been explored by previous 
authors. As Riker notes, analysis of the “wasted vote” phenomenon dates back to Droop 
(1871), with more recent interest stemming from Duverger (1954) and Farquharson (1969). 
The main antecedents to this work, however, are the McKelvey-Ordeshook (1972) decision- 
theoretic and Cox (1994), Palfrey (1989) and Myerson-Weber (1993) game-theoretic models of 
strategic voting. Cox anticipates that the strict bipartite outcomes are likely to fail with a 
“lack of public information about voter preferences and vote intentions”, and the new theory 


provides a formalisation of this prediction. 


1.1.4 Guide 


The argument is formalised in the following sections. A simple model of a three-candidate 
election is constructed in Section 1.2. All individuals save one are assumed to vote straightfor- 
wardly. Section 1.3 considers the optimal voting decision of the lone strategic voter, and elicits 
the key réle of the pivotal log-likelihood ratio. The behaviour of this statistic is illustrated 
in Section 1.4 with a motivating example. The main contribution of this part is in Section 
1.5, with an analysis of the asymptotic behaviour of this statistic. The limiting behaviour is 
illustrated in Section 1.6. Section 1.7 concludes the preliminary argument, as a precursor to 


the game-theoretic extensions of Parts 2 and 3. 
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1.2. MODEL 


1.2.1 The Election 


There are three candidates in a single-seat district election, indexed by j € {1,2,3}. Policy 
platforms have been set, and hence voting will take place without any candidate action. There 
are N +1 members of the electorate, indexed by i € {0,1,...,N}, where N = (1+ 1)n. 
Voter 0 is a focal voter, and will consider the behaviour of the remaining N voters to assess 
the likelihood of the various electoral outcomes. Realised vote totals for the N voters i > 0, for 
each of the three parties will be denoted {a;}_,, so that yy x; = N =(1+)n — hence 
each individual casts a single vote. The candidate receiving the largest number of votes wins 


the election — a plurality mechanism. 


1.2.2 Voter Behaviour 


The argument will focus on tactical vote switches between two candidates. Envision a situation 
in which voters prefer either the first candidate, or alternatively both of the remaining candi- 
dates over the first candidate. With this in mind, it is assumed that a fixed fraction 7/(1+71) 
of the N electors 7 > 0 will vote for candidate 1, so that 2, = yn. To obtain a model of 


interest, the following assumption is upheld. 


Assumption 1. The fraction y1 satisfies 1/2<y1 <1. 


Notice that 7, > 1 yields a certain win for candidate 1. 7, > 1/2 ensures that any close result 
must be between candidate 1 and candidate j € {2,3}. Throughout the analysis, it is implicitly 
assumed that 7,7 is integer valued. This is without loss of generality, since all results continue 


to hold when x = [yn] or 21 = [yn]. 


The model is decision-theoretic, and hence the N voters 7 > 0 act straightforwardly, and vote 
for their preferred candidate. An individual 7 receives a payoff uj; if candidate 7 wins the 
election. It follows that x1 = yn individuals satisfy uy > max{uj2, wiz}. The remaining n 
individuals (and also the focal individual i = 0) satisfy min{wj2, wiz} > uj — candidate 1 is 
their least preferred option. It follows that x2 + 73 = n. The focal voter 7 = 0 is short-term 
instrumentally rational: This voter’s decision is determined solely by its anticipated influence 
on the outcome of the election. It follows that such a voter must support either candidate 2 or 
3. The model is designed to focus on strategic vote switches between these two options. With 


this in mind, the following notation is employed. 


Definition 1. Henceforth, all references to “a voter”, “voters” or the “electorate” will refer to 


then +1 supporting candidates 2 and 3. 


Since x; = 71”, it follows that a fraction 7, of these n voters must successfully coordinate on 


either candidate 2 or 3 in order to defeat candidate 1. Voting is thus a coordination game 
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between the n+ 1 individuals. Whereas this chapter will consider only the optimal response of 
the focal voter 7 = 0, Chapters 2 and 3 take a fully game-theoretic stance and allow strategic 


behaviour throughout the electorate. 


It remains to specify the distribution of preferences within the n + 1 individuals satisfying 


min{w2, u3} > wi. Denote by p the probability that a voter prefers candidate 2, so that: 
p = Priu2 > w3| min{uj, w3} > uaa] 


Fixing p, the model thus far is equivalent to those of previous authors, including Cox (1994) 
and Fey (1997). In particular, voter preferences (and hence truthful voting decisions) are 
independent. It follows that the expected vote share of a candidate is known. Applying the 
Law of Large Numbers, the realised vote shares converge in probability to the expected vote 
shares for large constituencies. The independence assumption is removed by allowing p to be 


uncertain. 


Assumption 2. The focal voter i = 0 is uncertain of p. Conditional on any information 


available, p takes values in (0,1) with continuous density f(p) > 0, where fir = sup, f(p) < oo. 


Uncertainty over p leads to interdependence of voting decisions. The decision of one individual 
yields an updated posterior over p, and hence changes the probability of subsequent decisions. 


This is made clear in the example below. 


1.2.3 Parametric Example — The Beta-Binomial 


The Beta distribution yields a convenient conjugate prior for the binomial distribution — see, 
for instance, Casella and Berger (1990). The Beta has density: 


f(p) = aren SEAN 1 p)3- 1 
P(G2)T( 


This yields 72. = E[p] = 82/(G2 + (3). For convenience, fix 82 = m7. and $3 = m(1— 7.2). This 
fixes the mean at 72, and the parameter m may be used to vary the precision of beliefs. The 
associated density becomes: 

Tm) 


f= taaprmiceyP pron (1.1) 


To check the precision interpretation of m, calculate: 


1 _ (G+ hP(Go+ B+) __ m+ 
vary] B23 Voll 42) 


The distribution is illustrated for y2 = 32/62 and a variety of m in Figure 1.1. As the precision m 


increases, the distribution concentrates all mass around p = 72, and hence m — oo corresponds 
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— = 
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 


p= Priu2 > ug | min{ue, us} > ur] 
Figure 1.1: Beta Prior for Various m 


to the case of certain constituency-wide support. The particular parametric choice of 72 = 32/62 


corresponds to the motivating example of Section 1.4. 


1.3. VOTING BEHAVIOUR 


A vote can only influence the outcome of an election if it is pivotal. A pivotal situation arises if 
there is a tie for the lead. If this occurs, a single casting vote may swing the outcome in favour 
of one of the candidates. More generally, a voter may also influence the outcome if the votes 
cast for the leading candidates differ by a single vote. The voter may then ensure the win of 
one candidate, or alternatively force a draw. For this latter case, the tie-break rule adopted in 


the election comes into play. 


The characteristics of the optimal voting rule are familiar. Recall that Assumption 1 stipulated 
y > 1/2, so that x, = 7,n > N/3. It is clear that any tie for the lead must involve candidate 1. 
The possibilities are thus x; = %) > #3 and x; = 73 > x3. Near ties, for instance 71 = r + 1, 
are also relevant. Denote the “pivotal” probabilities, conditional on the information of the focal 


voter 7 = 0 by q2 and qs respectively: 
qj = Pr[yin -—1 <a; <n] for 7 € {2,3} 


These are the pivotal probabilities of candidates 2 and 3 lying in tied (or near-tied) contention 
for the lead with candidate 1. 


Lemma 1. A typical individual votes for party 2 if and only if qo(u2 — ur) > ga(ug — ur). 
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If there is a tie between candidates 1 and 2, the voter can shift the result from candidate 2, 
so gaining u2 — u,. The voter loses the opportunity to make the casting vote in a tie between 
candidates 1 and 3, sacrificing uz — u;. The voting rule trivially follows. This sketch neglects 


the outcomes from a dead tie. A fuller proof follows. 


Proof of Lemma 1: Recall that the restriction is to voters i satisfying min{wj2, wiz} > wa. A 
voter will never find it optimal to vote for candidate 1, and hence will choose to vote for one of 
candidates j € {2,3}. If x; > yin, then candidate j will win without the support or otherwise 
of voter 7, and hence the vote has no influence. Similarly, if max{x;} < yn —1, the additional 
vote does not influence the election. If, however, for some j € {2,3} there is a tie x; = yn, 
then an additional vote for candidate j will yield a win. Similarly, if x; = yn — 1, then an 
additional vote for candidate j will force a tie. Impose the assumption that that ties are broken 
at random, corresponding to British electoral convention. Then the potential outcomes and 


payofts from voting for candidates j € {2,3} are as follows: 


Event u(Vote 2) | u(Vote 3) | u(Vote 2) — u(Vote 3) 


UWi2tUir 
2 


wae 
rQ=yn—-1 Uil a ae | 


Ujg ris Uig— Ui 
2 


m2=71N Ui2 


r= yn-1 Usd 


Uj3g rit — Ui3g—UiL 


= irae ae ; _ Wig Us 
w= yin > ee Wi3 aoe 


Assembling these payofts, the optimal rule is clearly: 


Vote 2 < Priyn> a> un-1~ 5 “1 > Prlyin > 2g > n- 1 5 = 


Recalling the definition of q;, this is the desired result. O 


A further implicit assumption of Lemma 1 is that min{q;} > 0. Such a condition rules out 
Duvergerian outcomes, where all n voters support only one of the candidates j € {2,3}. In 
this case, unless 7, = 1, the tie probabilities are exactly zero. Within such a scenario, a voter 
is exactly indifferent between the two options — for positive q; this is (generically) never the 
case. In the decision-theoretic analysis of this part, the focal voter acts on the assumption of 
straightforward voting by the remainder of the electorate, and this case will never arise. The 
game-theoretic analysis of Parts 2 and 3 must address this issue, however, when considering 


the possibility of Duvergerian equilibria. For positive q;, the following notation is introduced. 


Definition 2. The (perceived) log-likelihood ratio of pivotal probabilities is X = log(q2/q3). 


Using this notation and Lemma 1, the optimal voting rule becomes: 


eyes 
Vote2 = log —> 4 
Ui3Z — Ui 


3An assumption implicit to Lemma 1 is that an indifferent voter casts their vote for candidate 2. This is 
made without loss of generality. 
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Hence the key statistic of interest to a voter is the log-likelihood ratio A. The tradeoff is clear. A 
strategic voter balances the relative likelihood of each candidate 7 contending for the lead against 
the relative preference for a win by each of the two candidates. Notice also that provides an 
exact measure of the incentive for a voter to act strategically. In particular, if an individual 
votes straightforwardly, then they vote for candidate 2 whenever log(ug — u,) > log(ug — uz). 


This is equivalent to using the optimal voting rule with A = 0. 


1.4. ILLUSTRATION 


1.4.1 Strategic Voting with Certain Candidate Shares 


The key assumptions behind the Duvergerian conclusions are best understood with reference 
to the following example. To avoid the numeric reference of candidates, the names of the three 
main British political parties have been employed.* Needless to say, the example is pedagogical 


and hence not designed to represent any particular constituency. 


Example 1. In a three candidate plurality election, the electors cast their votes truthfully. 


Voter preferences are independent. The expected vote shares are: 


Candidate 
Conservative 0.88 
Labour 0.82 
Liberal Democrat | 0.30 


In Example 1, the Conservative candidate is identified as the leader. The trailing Labour and 
Liberal Democrat candidates are close, but with Labour slightly ahead. Notice that the as- 
sumption of independent voting probabilities is retained. Equivalently, the voting probabilities 
are known with certainty. Employing the plurality rule, the winner is the candidate receiving 


the most votes. Straightforward voting leads to a Conservative win. 


Consider the introduction of an instrumentally rational voter. As is standard, such an individual 
cares only about their vote insofar as it effects the outcome of the election. Their optimal 
choice may involve switching their vote away from their preferred candidate. Suppose that 
the electorate is of size N. Cox (1997, p. 75) indexes the candidates by j, and denotes the 


“In the British General Election of 1997, the Labour and Liberal Democrat opposition parties challenged the 
incumbent Conservative Party. Curtice and Steed (1997) note that in seats held by the Conservatives, voters 
switched between each of the opposition parties in an attempt to dislodge the unpopular incumbent. Fixing 
the Conservative vote share and examining switching between Labour and the Liberal Democrats provides a 
reasonable specification. See Butler and Kavanagh (1997) for more details of the British general election. It is 
worthwhile noting that in British general elections strategic switching has occurred between all political parties. 
It follows that the model considered here is not exact — rather, it illustrates the key factors influencing the 
decision of a strategic voter. 
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associated voting probabilities by 7;, where the leading candidates are j € {1,2}. He explains 


the standard strategic voting logic as follows: 


“If 0 < mj; < m then candidate j is virtually sure to lose for sufficiently large N, 
and voting for the most palatable of the candidates most likely to be tied for first 
yields a higher expected utility than voting for 7.” 


The first part of this statement is empty — for sufficiently large N, candidate 2 is also “virtually 
sure to lose”. Indeed, in Example 1, with a large electorate, the Conservative candidate will 
win with near certainty — this is a simple consequence of the Law of Large Numbers. The key 
criterion (as Cox correctly recognises) is the probability of a candidate being in tied contention 
for the lead — a pivotal outcome. Although this is a remote possibility, it is the only instance 
in which an extra vote can swing the outcome of the election. This pivotal probability becomes 
vanishingly small as N grows large. An instrumentally rational voter, however, is only interested 
in the relative likelihood of the various possible pivotal possibilities given that a pivotal outcome 
actually occurs — if a pivotal outcome does not occur, a single vote carries no weight and hence 
such situations are irrelevant. These are conditional pivotal probabilities — for instance, the 


probability of a Conservative-Labour tie®, given that a tie between any two candidates occurs. 


To sharpen the analysis, place Example 1 into the model of Section 1.2. Fix the Conservative 
vote at x; = yn, and the total electorate at N = (1+71)n, where 7; = 0.38/0.62. Inspecting 
the specification of Example 1, it is clear that any pivotal outcome must involve the Conser- 
vative candidate. Moreover, one might intuitively expect that a Con-Lab tie for the lead is 
more likely than a Con-LibDem tie. With this in mind, a Liberal Democrat supporter might 
switch to Labour in an attempt to prevent a Conservative win — or indeed vice versa. In a 
decision theoretic setting, one can thus expect at least some strategic voting. This leaves open 
the question of whether this strategic voting will be complete. Clearly, a Liberal Democrat 
supporter will need to balance a preference for their first choice against the relative likelihood 
of the different tie probabilities. Since the Labour and Liberal Democrat candidates are “close” 
in Example 1, one might expect the conditional pivotal probabilities to be close, and thus 
a sufficiently committed Liberal Democrat may find it optimal to vote truthfully. Intuition, 
however, is insufficient to reach this conclusion — calculation of the relevant probabilities is 


necessary. 


Figure 1.2 presents the conditional pivotal probabilities for Example 1. They are calculated for 
a range of electorate sizes. Even for small electorates it is far more likely that a Con-Lab tie is 
observed than any other tie for the lead. In fact, for n = 500, a Con-Lab tie is approximately 
1500 times more likely than a Con-LibDem tie. For larger n, any tie is virtually certain to be 


between the Conservative and Labour candidates. 


’Henceforth tying pairs will be referred to by the abbreviations Con-Lab, Lab-LibDem and Con-LibDem. 
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Figure 1.2: Pivotal Probabilities for Certain p = y2 = 0.32/0.62 


Lab-Con tie 


-—---  LibDem-Con tie 
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Figure 1.3: Pivotal Probabilities for Uncertain p 
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This example clearly illustrates the driving force behind the strict version of Duverger’s Law. 
However close the trailing parties are, as the electorate size grows large the conditional pivotal 
probability of a tie between the leading two parties approaches 1. Almost all instrumentally 
rational voters will thus switch their vote to their preferred candidate from the two leaders. 
Notice that game theoretic reasoning is unnecessary — a Duvergerian outcome obtains without 
the need for a voter to consider the strategic actions of others. Of course, if they do consider this, 
it only serves to reinforce the effect. Essentially, with a large electorate there is no uncertainty 
remaining in the mind of a voter. All of the idiosyncrasy of individual electors is averaged out 


by the electorate size. 


1.4.2 Strategic Voting with Uncertain Candidate Shares 


The conclusions of Example 1 are both unintuitive and unattractive. For an unboundedly 
large electorate, the model implies that voters are certain that the Conservative and Labour 
candidates are the leaders, and moreover are certain that the Liberal Democrat candidate will 
never be in contention for the lead. Certainty over voting probabilities may be a simplifying 
assumption, but it is the feature that drives all the results. This observation is illuminated with 


reference to the following extension of Example 1. 


Example 2. Retaining the framework of Example 1, in a three candidate plurality election, 
the electors cast their vote truthfully. Conditional on knowledge of constituency-wide support, 
voters are independent. As before, the Conservative share is fixed at 0.38. There is uncertainty, 
however, over the probability of a voter supporting the Labour candidate, represented by a Beta 
prior over p, with E[p] = y2 = 0.32/0.62. 


In Example 2, the assumption of certain voting probabilities is relaxed. Whereas the Conser- 
vative vote share is known, there is uncertainty over the Labour (and hence Liberal Democrat) 
voting probabilities. The Beta distribution of p is that described in Section 1.2.3, with m = 40, 
Bo = my and 33 = m(1— 2). The expected value of this distribution is y2 = 0.32/0.62. A 
voter in isolation is believed to support Labour with this probability — for a single individual, 
the voting probabilities of Example 1 are retained. This does not extend to multiple voters. 
If a voter is known to support Labour, this is a signal of a higher Labour support probability. 
An additional voter would be expected to support Labour with probability greater than yy. It 
follows that the preferences of electors are no longer independent. This invalidates the applica- 
tion of the Law of Large Numbers. As a consequence, the outcome of an election (with truthful 
behaviour) is no longer certain in large electorates. The focal voter believes that the Labour 


candidate is the most likely challenger, but is not certain that this is so. 


Once again, it is not the voting probabilities that determine the behaviour of an instrumentally 
rational voter, but rather the conditional pivotal probabilities. Figure 1.3 plots these probabil- 
ities for Example 2. Notice that the conditional probability of a Con-LibDem tie does not fall 


to zero. Of course, a Con-Lab tie is more likely than a Con-LibDem tie, and hence there will 
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be at least some incentive for strategic voting from the Liberal Democrat to Labour. Since the 
possibility of LibDem contention is still open, however, a sufficiently partisan Liberal Democrat 
supporter will find it optimal to vote truthfully. Strategic voting is not complete. It follows 


that, in a decision-theoretic environment, the strict version of Duverger’s Law fails. 


Figure 1.3 suggests that the conditional pivot probabilities tend to some finite limiting value 
as the constituency size grows large. Moreover, this limit is approached for relatively small 


constituency sizes. The analysis of Section 1.5 investigates. 


1.5 PIVOTAL PROPERTIES 


Section 1.3 highlighted the pivotal log-likelihood ratio \ = log(qi/q2) as the clear statistic of 
interest. Moreover, the second illustrative example of Section 1.4 suggested a finite limiting 
value for A for large constituencies. This section provides a formal investigation of its limiting 


properties. The first step is the evaluation of the pivotal probabilities {q;}. 


Lemma 2. The pivotal probability qo, conditional on the information of a typical voter is: 


= mn : { 7117 l-—p V1 
© = oa I [p™ (1 — p)-™] {1+ ; tae} fee (1.2) 


A symmetric expression holds for qs. 


Proof. Simple application of the binomial distribution — see Section 1.8. O 


Notice that the pivotal probabilities q; vanish as n grows large — with a large electorate, the 
probability of observing a tied outcome becomes vanishingly small. Rather than these pivotal 
probabilities, it is the log-likelihood ratio A that is of interest. With uncertainty over p, this 
converges to an attractive expression. The following proposition is the central contribution of 


this part, and will be key in the game-theoretic analyses of Parts 2 and 3. 


Proposition 1. Allowing the electorate size to grow large: 


Examining Equation (1.2), notice that the integrand vanishes to zero due to the leading term 
[p+(1 — p)!~-7]”". As n grows, this expression becomes increasingly more peaked around its 
maximum. This maximum occurs at p = 71, and hence only density at this point contributes 
any weight. By contrast, the integrand of q3 is maximised at 1 — 7. A formal proof draws 


upon from the following lemmata. 


Lemma 3. For large constituencies, there is equal probability of a tie or near-tie: 


fies ses wg 
noo Pri[a; = yn — 1 
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Proof. The formal proof mimics that of Lemma 4 — see Section 1.8 O 


Lemma 4. For large constituencies, the likelihood ratio of exact ties tends to f(y1)/f(1-—1): 


(hie = al = es 


Proof. Begin by recalling the formulae for Pr{av2 = yn] . It follows that: 


Prix =n] fo (P= —p)!-™]" fe)dp fo [p" (1 —p)'-™]” f@)dp 


Prizg= 17] [((1—p)p!-1]" f(p)dp ff) [pn (1 — p)!-]” £1 — p)ap 


where the second inequality follows from a simple change of variables in the denominator. 


Introduce the notation G(p): 


G(p) = Gee Pri =nnl _ _b Ge)" Fe)dp ag) 
at (1-—)1-% Pri[z3 = yn] Wee G(p)" f(1 — p)dp 

Notice that G(p) is increasing from G(0) = 0, attaining a maximum of G(y1) = 1 at p=, 
and then declining back to G(1) = 0. Next fix 1/2 > e€ > 0. For notational convenience, define: 


fiw) = inf f) 
x—e<p<a+e 
fie(@) = sup f(p) 
e—e<p<ar+e 
fu = sup f(p) 
O<p<1 


Here the assumption that f(p) is bounded above has been used. Employing this notation, 


formulate an upper bound for the ratio in Equation (1.3): 


Prey = yan). fite(n) SEE Oya + Fr [Jo Gap + Li, 4 @D)"ap| 


Pr[zz = 17] — fre(1— 1) [OE G@)rdp 


(1.4) 


The right hand side of Equation (1.4) has three terms. These will be considered in turn. First: 


fan) Jie GR)" a _ _ fuel) 
frel-%). bei G(p)"dp — fre(l-) 


Next consider the second term. G(p) is increasing from 7 — € to 7, and hence: 


“V1 FE “1 
if G(p)"dp > [” G@yap > Gin — 9" 
oy: 


1—e€E SIYVI—E 
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1-1 2 V1 


Figure 1.4: Beta Prior Over p for m = 40 


Taking the second term, and allowing n — on, it follows that: 


Io’ * G(p)"ap — a [ae |'m—o 
0 


SE G(pyndp ~ €. eo 


which holds since G(p) < G(7 — €) for all p < 7 —e. An identical argument ensures that the 


third term vanishes. Conclude from this that: 


lm Pre = HI 2 fre(M) 
N00 Pr[x3 = yn field _ v1) 


Notice now that € may be chosen arbitrarily small. It follows that: 


fre(M) f(m) 


lim = ——— < lim —*——_ = 
noo Prix3 = yn] ~ 0 frel—-m) fd-m) 


Priva = 17] 


A symmetric procedure bounds the limit below, and the result obtains. | 


Combine Lemmata 3 and 4 to obtain the proof of Lemma 2. 


Proof of Proposition 1: Employing Lemmata 4 and 3, it is clear that: 


q _ Priv =nn|d+Prie=yn-1/Prie2=vn)) fin) 


qs Prizwg =in|(1+ Pri~g=yin—1]/Prlzz=yin]) fd- 1) 


This proves the desired result. O 
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A number of observations are pertinent. First, notice that the pivotal log-likelihood ratio 
A converges to a finite value as the constituency grows large. Second — and perhaps more 
importantly — Proposition 1 implies that strategic voting is entirely driven by uncertainty over 
the probability p. It is the uncertainty over the constituency-wide support of the candidates, 
rather than uncertainty over the idiosyncratic behaviour of a particular individual that drives 
strategic voting. Figure 1.4 illustrates this point for Example 2. For large n, X converges to 
the log-likelihood ratio of an exact tie in constituency wide support of Lab-Con (p = 7) versus 
LibDem-Con (p = 1— 1). 


This is a stark contrast to the Cox-Palfrey case. Suppose that constituency-wide support of all 


candidates is known, so that p = 72. Modifying Lemma 2 appropriately yields: 


= n! ae Se =o gi 
®= Gaya nya 2-2] {i+ = tan} 


with a symmetric expression for q3. Constructing the ratio, cancelling common terms and 


taking limits yields, for 72 4 1/2: 


1/2 
lim log 2 _ lim n(2y1 — 1) log ( te ) Se te / (1.5) 
n-00 q3 n—0o 1- 2 -~wo y< 1/2 


It follows that in the Cox-Palfrey case — unlike the encompassing model presented here — the 


incentive to vote strategically is unbounded in large constituencies. 


1.6 COMPARATIVE STATICS 


The Cox-Palfrey model predicts complete strategic switching, and hence a bipartite outcome. 
Since strategic voting is always complete, it cannot change in response to the parameters. The 
only comparative static is discrete — changing the identity of the two leading contenders results 


in a different bipartite outcome. 


Comparative statics are available for the model considered here. This section conducts such 


comparisons with the aid of the Beta prior described in Section 1.2. 


1.6.1 Opinion Polls and Information Precision 


Section 1.2 introduced the Beta prior as a convenient conjugate for the binomial distribution. 
In that section, the precision of the focal voter’s beliefs carried the index m. A microfoundation 


is available, however. Begin with a Beta prior over p: 


ae Fae 1 p)— 1 


f(p) = T( Bs) 
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Suppose that the focal voter now observes an opinion poll of size m. This samples voters 
satisfying uy < min{uy, ug}. Denote the realisation of this random sample by y2 and y3 = m—yp. 


A standard application of the beta-binomial yields: 


TG + 63 +m) 2+B2-1/7 _ »\mM—y2+63—-1 
Bo + yo) T(B3 +m — ya) ne 


f(ply) = ™ 


Suppose that the share of candidate support in the opinion poll is y2, so that yo = yam. 
Assume without loss that 72 > 1/2, so that candidate 2 is expected to be the closest challenger. 


Specialise to a uniform prior, where G2 = 93 = 1. The posterior density becomes: 


I'(m + 2) 
yam + 1)T((1 — 42)m + 1)) 


fel =a pr Leap) Pe (1.6) 
Compare Equation (1.6) to Equation (1.1, p. 6). The precision index m can thus be viewed 
as the size of an opinion poll observed by the focal voter; the modification in Equation (1.6) 
is due to the inclusion of the flat prior over p. For a large electorate (n — oo) the pivotal 


log-likelihood ratio becomes: 


f(m) _ = PaaS V1 _ 7 eis v1 
Foy = Oe -D oe ( ) =o ns) os ( -) ae 


A= lo 
Lit 


where 73 = 1 — 7. Inspecting Equation (1.7) yields the following proposition. 


Proposition 2. Strategic voting incentives are increasing in (a) the strength of candidate 1, 


(b) the perceived gap between candidates 2 and 3 and (c) the precision of information m. 


The comparative statics obtained here are mirrored by those obtained in the fully game-theoretic 
models of Parts 2 and 3. Indeed, as Section 2.1 notes, they challenge established intuition to 
some extent. Notice that in a “close” election, there is little incentive for strategic voting. 
Allowing 71 | 1/2 and y2 — 43 | 0 results in A | 0. Perhaps the most important conclusion, 
however, is the insight offered into the role of information precision. The pivotal log-likelihood 
ration A becomes unboundedly large as m increases. This effect is illustrated in Figure 1.5. 


More formally, taking limits: 


V1 
Lyi 


m—- co 


lim A= im. m(y2 — 93) log ( ) =+oo for min{y1,y2} > 1/2 (1.8) 
Notice the close correspondence between Equation (1.8) and the Cox-Palfrey limiting case 
in Equation (1.5). There are two key differences, however. First, the rdles of 7; and 72 are 
interchanged. Secondly — and more importantly — the Cox-Palfrey framework suggests that an 
unbounded constituency size is the key ingredient for a strict Duvergerian conclusion. Equation 
(1.8) demonstrates that this is not the case. Rather, it is an unbounded opinion poll size that 


is required. 


Of course, as Figure 1.5 clearly illustrates, a relatively modest opinion poll is sufficient to obtain 
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m = precision of information 
Figure 1.5: Conditional Pivotal Probabilities for n — co and Varying m 


a near-Duvergerian outcome. This suggests that, in a decision-theoretic environment, one might 
expect near-complete strategic voting. Such a conclusion is in error. Part 3 argues that voting 
takes place at a constituency level, whereas high-precision public information sources (such as 
opinion polls) are typically seen at a national level. It is reasonable to expect the precision of 
information on constituency standings to be rather lower. In this case, the decision-theoretic 


model leads to a non-Duvergerian conclusion. 


1.7 CONCLUSION 


This first step argues that the key assumption driving the strict Duvergerian results of earlier 
formal voting models is the assumed independence of voter preferences. In a decision-theoretic 
framework, a non-Duvergerian outcome emerges. Perhaps most importantly, constituency un- 
certainty rather than idiosyncratic uncertainty is the key determinant of a strategic voter’s 


decision. 


Issues remain. The analysis presented here is decision-theoretic — voters do not take the strate- 
gic decisions of others into account. A game-theoretic perspective is required. Indeed, one might 
expect strategic voting to be self-reinforcing, and hence the possibility of fully Duvergerian game 
theoretic conclusion remains open. Furthermore, the constituency-level uncertainty lacks a mi- 
crofoundation. Finally, the information sources available to the electorate have yet to be fully 


explored. 
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These issues are addressed in the following partss. Part 2 provides a foundation for constituency 
uncertainty, by breaking voter preferences into common and idiosyncratic components. More- 
over, direct modelling of preferences allows a full game-theoretic analysis. The insights carry 
through — with sufficient precision a Duvergerian conclusion emerges. Part 3 generalises the 
model further with the introduction of privately observed signals. It argues that, at a con- 
stituency level, this is a more plausible framework. Surprisingly, it is shown that strategic 


voting exhibits negative feedback, and is self-attenuating. Non-Duvergerian conclusions follow. 


1.8 OMITTED RESULTS 


1.8.1 Pivotal Probabilities with Uncertain p (Section 1.4.2) 


This section outlines the procedure used to generate Example 2 and Figure 1.3. The prior 
distribution over p is the Beta distribution, with parameters 32 = m7 and 33 = m(1— 42), 


where for illustration m = 40. A standard application of the distribution shows that: 


Tv=yn| = as ee eee ; 1n+B2-1(1 _ »\A-y1)n+63-1 

P 2=%71 = T'(4)T'(63) P(b. 3) (qin)! (1—)n )! if D - (1 P) er dp 
a oa + 3) D(n +1) Dijin + P2)P (1 — y1)n + Bs) 
~ -T(G)P (83) Poin + YP = 1)n +1) T(Go + 83 +7) 


It follows that: 


Pri[z2 =n] — T(yin+fe)T((1-—1)n + Bs) 


Prizg =n] = (yin + 63)P((1 — y1)n + fe) 


Figure 2 plots this ratio for varying n. This is not the exact ratio q/q3, since it omits near 


ties. Inclusion of exact ties in this illustration has a neglible effect. 


1.8.2 Omitted Proofs from Section 1.5 


Proof of Lemma 2: Recall from Section 1.3 that optimal voting behaviour is contingent upon 
the relative probabilities of a tie for the lead. Such a tie requires y,n voters to coordinate on a 


candidate j € {2,3}. In addition, the event of an “almost tie” is also of interest. Thus: 
qj = Prryn-l<aj <n 


As in Section 1.2, denote by p the (conditionally independent) probability that a voter supports 
candidate 7 = 2. Thus, conditional on p, the vote total xg is binomially distributed with 


parameters p and n: 


n! 
Relea il eg saaia 


prdep =)" 
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From the perspective of the focal voter, p is unknown. It follows that: 


n! , (ple 
Pr[z2 = yin] = eal [p™(L—p)'"]" f(p)dp 


Furthermore: 


1 


n! n nl—- 
ae Soap 


(yn)! (1—-j)n TEER [p™ (1 — p)' 11] 


Combining these yields the appropriate expression. It follows that a symmetric expression holds 


Prizg = yin - 1] = 


for q3. O 


Proof of Lemma 3: Employ the notation used in the proof of Lemma 4. Hence, for 7 = 2: 


Pri[ap = yin - J _ v1 he p)” = f(p (p)dp 
Pr[z2 = 17] a pen )" f(p)dp 


The limit of the first term is y,/(1 — 7). An upper bound for the second term is: 


frre (on) [Rit G(vy Bap + fin [Jo * Geo" Bap + [ope (vy Bd 


VIE 
fel) [os G(p)"dp 


Taking the first term in this expression: 
fire(n1) SS Gp)" 2 fp) dp _ lente fun) 
fie(), Gia G(p)"dp ~ mae frelt) 


Next consider the second term: 


fee G(p) 
; G 


[P“eG(prdp ~ ©, (m —€) 


n 1 _ 
| P dp — 0 
Y1—€ P 


Similarly this holds for the second term. It follows that: 


lim 2@2=ur—d - a { a ee a 
neo Pria, = yn] esoo (La: a Se. frei) 


Once again, a symmetric procedure bounds the limit below, and the result obtains. | 


PART 2 


STRATEGIC VOTING WITH CONSTITUENCY UNCERTAINTY 


“In these matters the only certainty is that there is nothing certain.” 


Pliny the Elder 


2.1 INTRODUCTION 


2.1.1 The Roéle of Constituency Uncertainty 


Farquharson (1969, pp. 57-60) notes that interest in strategic voting dates back to Pliny 
the Younger. The comment of Pliny the Elder recorded here is perhaps apt in advising the 
modeller in the pursuit of its analysis. Indeed, the critique of Part 1 suggests that uncertainty 
over constituency-wide support of candidates is the key missing ingredient in the Cox (1994) 


and Palfrey (1989) models of strategic voting. 


The decision-theoretic analysis of Part 1 leads to a non-Duvergerian conclusion. Such an anal- 
ysis is incomplete, as it omits the effect of strategic voting by others on an individual’s voting 
decision. A game-theoretic perspective is required, necessitating a full specification of voter 
preferences. Furthermore, the source of constituency-uncertainty is unclear. A microfounda- 


tion for this uncertainty is desirable. 


To address these issues, the simple model of a three candidate plurality election described in 
Part 1 is extended. Voter preferences are fully specified. The model has much in common 
with those of Cox (1994) and Fey (1997), employing a random utility framework. The key 
innovation is the presence of a common utility element to the payoffs of each voter. The value 
of this common element is uncertain. This, in turn, induces uncertainty over the distribution 
of preferences in the population, and is equivalent to uncertainty about the popular support 
of candidates. It also captures the notion that there are constituency-wide factors which affect 
the preferences of the whole electorate, as well as idiosyncratic factors that are voter-specific. 


The remainder of this section discusses the approach in more detail. 
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2.1.2 Constituency-Wide Common Effects 


When will the independence of voting decisions fail? The preference specification of the new 
theory suggests a possible answer. Preferences are decomposed into common and idiosyncratic 
components. The common component affects all voters in a constituency. Idiosyncratic compo- 
nents are individual to a voter, and distributed independently across the electorate. This implies 
that, conditional on the common preference component, voting decisions are independent. This 
common component is, however, unknown to an observer — including the members of the elec- 
torate. It follows that unconditional preferences (and hence voting decisions) are correlated. 
The decision of one individual provides a signal of the common utility component, and hence 
the voting decisions of other individuals. Existing models (Cox 1994, Palfrey 1989, Fey 1997) 
are equivalent to this specification when the common component (and hence constituency-wide 


support) is commonly known. 


Part 1 argues that it is only uncertainty over constituency-wide support that drives strategic 
incentives. Indeed, with the microfoundation of common utility elements, the analysis here 
shows that it is uncertainty over common effects that matters. Once again, individual idiosyn- 
crasies are averaged out in a large electorate, and hence do not influence the likelihood ratios 


of tying pairs. 


2.1.3 Self-Reinforcing Strategic Voting 


Adopting a game-theoretic perspective, the possibility of a strict Duvergerian outcome remains 
open. Strategic switching from trailing candidates and toward the leading pair reduces the 
probability that trailing candidates are pivotal. This further increases the incentives of instru- 
mentally rational voters to switch their support to the leading pair. Further switching reinforces 
this effect. Where will such a process lead? There are two possibilities. First, the process might 
continue until all support for trailing candidates is lost. Alternatively, the process may converge 
to an equilibrium in which the trailing candidates retain some attenuated support. Untrained 
intuition is insufficient to determine which of these situations obtain, and a formal analysis is 
required. Such analysis necessitates a focus on the information source on which voters base 


their decisions. 


To explore this issue, voters are equipped with a public information source. All individuals 
commonly observe a public signal of the common component of preferences — a public signal 
of constituency-wide support for the candidates. This may be likened to the observation of 
an opinion poll, and in fact a detailed opinion poll provides the microfoundation. Naturally, 
voters have at least some privately observed information to assist them in their decisions. 
Absent communication with any specific outside source, a voter might engage in introspection, 
examining their own preferences to elicit a signal of the common component. In this part, 
such factors are ignored, and voters are assumed to discard their personal preferences when 


formulating their beliefs. This omission is rectified in Part 3, with somewhat surprising results. 
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With pure public information, where voters ignore any private signal, strategic voting is indeed 
self-reinforcing. In the three-candidate model considered here, the analysis identifies three equi- 
libria. The two Duvergerian equilibria entail complete coordination on one of the challenging 
candidates, yielding a bipartite outcome. There is also a unique non-Duvergerian equilibrium. 


An equilibrium selection problem is posed. 


Equilibrium selection is made according to a stability criterion. If the public signal is suf- 
ficiently noisy, then the non-Duvergerian equilibrium is uniquely stable. Beginning with an 
initial “straightforward voting” hypothesis and engaging in iterative best response, strategic 
voting exhibits positive feedback, but peters out as an interior equilibrium is attained. By 
contrast, when the public signal is sufficiently precise, strategic voting is explosive, and all 
support for the trailing candidate is lost as the best-response process converges to a Duverge- 
rian equilibrium. Existing models correspond to this latter Duvergerian case — a Duvergerian 
equilibrium requires public information of sufficient precision. In both cases, strategic voting is 


uni-directional. 


This selection device mimics the repeated election dynamic described by Fey (1997, pp. 142— 
146). Indeed, as voter beliefs are allowed to become unboundedly precise, the Fey model is 
obtained. Notice, however, that repeated elections are unnecessary to achieve the desired selec- 
tion. Beginning with an initial hypothesis of straightforward voting, the deliberation dynamic 


alone is sufficient to attain an equilibrium outcome. 


In a Duvergerian election, the analysis of this part lends qualified support to strict bipartism. 
This requires, however, sufficient precision of voter beliefs and hence sufficient detail in a public 
signal. As will become clear in Part 3, this Duvergerian conclusion is quickly overturned 


following the introduction of private information. 


2.1.4 Comparative Statics 


The comparative statics of the game-theoretic model mirror those of the decision-theoretic 
model of Part 1. In particular, there is a key réle for the precision of information. If information 
is sufficiently precise, then a fully Duvergerian outcome obtains. In the non-Duvergerian case, 


strategic voting is increasing in the precision of the public signal. 


As in Part 1, the impact of the expected strengths of the various candidates may be ascer- 
tained. As one might expect, the incentive to vote strategically is increasing in the expected 
gap between challenging candidates. Informal intuition has previously suggested that strategic 
voting increases as the lead of a leading candidate weakens. Cain (1978, p. 644) provides a 


classic example in his analysis of strategic voting in Britain: 


“we would expect the level of third-party support to be lower in competitive (i.e. 
closely contested) than in noncompetitive constituencies (i.e. one party dominant), 
since the pressure to defect and case a meaningful vote will be greater in constituen- 


cies with close races.” 
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This intuition fails. In fact, the incentive to vote strategically increases with the strength of a 
leading candidate. Intuitively, greater coordination is required to defeat this leading candidate, 
and this increases strategic incentives. Cain’s argument appears to be based on the absolute 
probability of influencing the election outcome. But as the analyses of both the new theory and 
its predecessors demonstrate, it is only the relative likelihood of tied outcomes that determines 


the optimal voting decision. 


In this fully-specified model, an additional parameter is available — the heterogeneity of voter 
preferences. The initial (decision-theoretic) incentive to vote strategically is unaffected by the 
heterogeneity of voters. The amount of strategic voting, however, increases as the heterogeneity 
of the population is reduced. Intuitively, if there is greater commonality of opinion, then only a 


limited incentive is required to encourage a large number of individuals to switch their support. 


2.1.5 Guide 


The argument continues in the following sections. Section 2.2 extends the model of Section 1.2 
to include a full specification of voter preferences, as well as the public signal observed by voters. 
Section 2.3 investigates the optimal behaviour of voters, and calculates the realised pivotal log- 
likelihood ratio given such behaviour. This allows an immediate decision-theoretic analysis. 
Equilibrium considerations are introduced in Section 2.4, where the perceived and realised 
pivotal log-likelihood ratios are equated. Possible equilibria are classified, and equilibrium 


selection is made. 


2.2. MODEL 


2.2.1 The Election 


The specification of the election is identical to that of part 1. The reader is referred to Section 
1.2 of that part for full details. There are three candidates j € {1,2,3} in a single-seat district 
selection, operating under the plurality rule. Realised vote totals will be denoted {x;}, where 
Sia; =(1+71)n+1. The vote total for candidate 1 is fixed at x] = yn, and remaining voters 
prefer both of candidates 2 and 3 to candidate 1. Assumption 1 is upheld (y, > 1/2), ensuring 
that any tie for the lead involves candidate 1. All references to a “voter” and the “electorate” 


will refer to the n + 1 remaining voters. 


2.2.2 Preference Structure 


A full specification of voter prefereces is unnecessary for a decision theoretic analysis. Given 
the beliefs of a focal voter (represented in Part 1 by the density f(p)), the pivotal log-likelihood 
ratio A may be calculated. The strategic incentive may then be investigated relative to the 


fixed preferences of any particular focal voter. 
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In a game-theoretic environment, additional structure is required. A focal voter must consider 
the probability of a strategic switch by other voters. This probability depends on both the 
perceived log-likelihood ratio and the distribution of voter preferences. To address this need, a 
random utility model is adopted. The utilities include common effects, which individual voters 


do not observe. Once again, the restriction will be to the n voters satisfying uj; < min{uj2, U3}. 
Notation 1. Voter i has von Neumann-Morgenstern utility uj; for party j. This satisfies: 


log 2 =H+ 6 (2.1) 
UiZ — Ui 


Refer to n and €; as the common and idiosyncratic components of utility. 


Notice that the left hand side of Equation (2.1) gives a scale-free measure of the preference 
for candidate 2 relative to candidate 3, each measured relative to the benchmark of candidate 
1. Section 1.3 of Part 1 demonstrates that this measure is sufficient for the optimal voting 
decision. Without loss of generality, it is assumed that 7 > 0, so the actual constituency-wide 


support for candidate 2 is greater. 


Consider next the decomposition on the right hand side of Equation (2.1). An easy inter- 
pretation is that 7 represents constituency-wide factors affecting all voters. By contrast, «; 
represents the idiosyncratic preference of voter i. More generally, 7 is the expectation of 
log (ua — ui1) — log(wiz — ui) conditional on all district-level information, generating the resid- 
ual component ¢;. This residual will be independent across voters. In addition, a parametric 
specification is employed. This is not critical to the argument, but rather allows an easy mi- 


crofoundation for the public signal described in Section 2.2.3. 


Assumption 3. The idiosyncratic component ¢; is distributed normally: €; ~ N(0,€?). 


If the common component 7 is fixed and commonly known, then voters’ preferences are inde- 
pendent. Such an assumption yields the models of Cox (1994) and Fey (1997). In particular, 
it implies certain knowledge of constituency-wide preferences and hence the election outcome. 
To see this, notice that the probability that a voter’s first preference is for party 2, conditional 
on 1, is y2 = ®(n/E), where ® is the cumulative distribution function of the standard normal. 
Defining 73 = 1 — 7, it follows that, with truthful voting, E[z;] = y;n. Indeed, fixing the 
constituency-wide truthful support levels for each candidate, the common utility component 
satisfies 7 = €67!(y9). 


If 7 is fixed and known, then the expected (truthful) support for all candidates is commonly 
known throughout the constituency. In a large constituency, enforcement of the Law of Large 
Numbers reveals that the outcome of any election with truthful voting is commonly known by 
all constituents — a restrictive assumption indeed. The key innovation of the new theory is the 
introduction of uncertainty over 7, and hence uncertainty over the support for each candidate. 
This provides a microfoundation for the uncertainty over voting decisions represented in Part 
1 by the density f(p). 
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2.2.3 Public Signals 


Voters begin with a diffuse prior over 7 — they have no knowledge of average candidate support. 
Information on candidate support may then be gleaned from a variety of sources. This part 
specialises to the case of public signals. One important source of public information is the 
publication of opinion polls and similar surveys. Formally, this is modelled as the receipt of a 


public signal, equal to the true value of 7 plus noise. 


Assumption 4. Voters commonly observe a public signal 1 ~ N(n,o7) . 


Following observation of this signal, and prior to the receipt of any private information, voters 
update to a common public posterior 7 ~ N(ju,07). A microfoundation underpins Assump- 
tion 4. Suppose that the preferences of m, randomly chosen inviduals are publicly observed. 


Indexing these individuals by k € {1,2,... , my}: 


1 A up — un ‘Ss 
b= —— > leg hN ( — (2.2) 
My i Uk3 — Uri ™y 


Thus a public signal with variance 0? = €? /m, is equivalent to the public observation of 
m, individuals. The generation of Equation (2.2) employs the distributional specification of 
Assumption 3. For large m,, however, the Central Limit Theorem suggests the normal as an 


appropriate specification for the distribution of pu. 


Viewed as an opinion poll, Assumption 4 provides a natural framework. In particular, the 
widespread publication of opinion polls is common during an election. Implicit to this interpre- 
tation, however, is the assumption that sampled individuals reveal their true preferences to the 
pollster. This issue is neglected here, and for good reason. As Part 3 argues, it is information 
on the district level candidate support that is of interest. Opinion polls are rarely conducted 
at the constituency level in countries such as the United Kingdom, and hence it is perhaps 
privately observed information that is of more importance. It is argued that, in this case, 
any signal is elicited from the observation of society over a longer time period, with reduced 
likelihood of strategic manipulation of viewpoints on the part of respondents. Of course, this 
justification immediately calls into question the relevance of a model based entirely on public 
information signals. The aim of this part is to provide a public-information benchmark to the 
full model of Part 3. The reader is invited to view the present formulation in this light, and 


reserve judgement until the conclusion of the argument in the following part. 


Of course, even in the absence of any explicit private signals, a private information source is 
implicitly available. As Section 2.1 noted, an individual’s own payoffs provide a private signal 
of the common utility element. Any rational voter will employ all information available to 
them in order to reach a decision. Pursuing the benchmark justification still further, however, 
this element is neglected here. Indeed, voters are explicitly assumed to form beliefs over the 


common component 7 using only the public signal. It follows that all voters share a common 
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public posterior belief: 


heey N (1, 02) 


This posterior may be obtained by beginning with a common public prior, Bayesian updating, 


and allowing the precision of the prior to vanish. 


2.3. VOTING BEHAVIOUR 


2.3.1 Optimal Voting 


As Section 1.3 observes, a vote can only influence the outcome of the election if it is pivotal. A 
pivotal situation arises with a tie or near-tie for the lead. Lemma 1 deduced the optimal voting 
rule for a voter, conditional on the perceived probabilities gz and q3 of either a 1-2 tie or a 1-3 
tie. Indeed, recalling Definition 2 of the pivotal log-likelihood ratio A = log(q2/q3), this optimal 
voting rule satisfies: 


oe ca 


ai 
Vote2 = log—> 4 
Ui3Z — Ui1 


The analysis focuses on symmetric strategy profiles. This is without loss of generality, since 
voters will act optimally, and their beliefs will be based entirely on the public signal jz. Condi- 
tioning on the perceived log-likelihood ratio A, which is common to all voters, and the common 


utility component, an individual supports candidate 2 whenever: 
PREENEO (2.3) 


This yields the following simple lemma. 


Lemma 5. Conditional ony and 4, a voter supports candidate 2 with probability: 


Ug — Ua g 


where ® is the cumulative distribution function of the standard normal. 
Proof. Conditional on 7 and A, €; ~ N(0,€?). Re-arrange Eq. (2.3) to obtain the result. O 


This implicitly assumes that A 4 too. These cases will be considered in Section 2.4. 


Suppose that all voters share a common perception A. What will be the realised pivotal log- 
likelihood ratio \? A symmetric strategy profile yields an identical perception A of the log- 
likelihood ratio of pivotal probabilities for each voter. Fixing this perception, and hence the 
strategy profile, yields a realised log-likelihood ratio \. This depends in turn on the probability 
that a voter supports candidate 2 (p), and uncertainty over this probability. Represent this 
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uncertainty by the density f(p), and assume for now that this is bounded above. Allowing the 


constituency size n to grow large, Proposition 1 from Part 1 reveals: 


Proposition 3. Allowing the electorate size to grow large: 


Jim, \= log En 


This requires evaluation of the density f(p). Uncertainty over p is driven by uncertainty over 


y, and yields a convenient formula for De as the next proposition reveals. 


Proposition 4. With pure public information and X 4 +00 the realisation A satisfies: 


ha) = 2 Mn+ e) (2.4) 


Proof. Conditioning on 7 and A, Lemma 5 reveals that: 
ere - 
p=Pr log 2" +a > ol] =o (>) 
UWi3 — Util 
Hence: 


F(p) = Pr jo (>) . Hl =Pr[n < €@""(p) — A) =@ oes 


Differentiate this expression with respect to p to obtain f(p): 


og £-1(p) — v— ps 
fp) = sae ( o ) 


Notice that this density is bounded above, as required. Next, evaluate this density at y, and 
1—7. Exploiting the symmetry of the normal distribution, note that ®-!(1—71) = —®7!(y). 
Furthermore, ¢(z) = ¢(—z). It follows that 6(®71(1 — 7)) = ¢(®7!(%)). Taking ratios: 


iG ae es (aw PUP COr aa a) 
) 20? 


Simplifying the numerator in the exponent and taking logarithms yields the desired result. O 


2.3.2 Decision-Theoretic Voting 


This proposition immediately allows a decision-theoretic conclusion. McKelvey and Ordeshook 
(1972) consider the optimal response of a strategic voter, assuming that all other voters act 
straightforwardly. A straightforward voter chooses to vote for their preferred candidate. Natu- 


rally, this is equivalent to voting optimally with a perceived pivotal log-likelihood ratio of A = 0. 
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Employing Proposition 4, a realised log-likelihood ratio Apr obtains: 


Apr = \(0) = ue “ye 
a 
This provides a measure of the incentive for a decision-theoretic strategic voter to switch sup- 
port away from a preferred candidate, given the assumption that all other voters act straight- 
forwardly. This measure requires an appropriate normalisation. First, recall that the micro- 
foundation for public signals specified that 0? = €?/m,,, where m, is the size of a publicly 
observed survey of voter preferences. This yields: 
Apr = cee) 
€ 
It is immediate that the incentive to vote tactically is increasing in the precision of public 
information (m,), the strength of the first candidate (7) and the perceived distance between 
the second and third candidates (yu). It is tempting to conclude that the incentive is also 
decreasing in idiosyncratic variability (€). This conclusion requires more thought, however. 
Varying € also changes the expected (truthful) support of candidate 2, since 72 = ®(n/&). A 


more appropriate comparative static is to fix y2, and allow 7 to vary with €. In this case: 


2my~O | (q1)n 
g 


Similarly, the variance of Apr is invariant to €. It follows that € has no direct effect on the 


E|Apr] = = 2m, (JO (92) (2.5) 


incentive to vote strategically. More interestingly, however, the probability that a voter acts 
strategically is decreasing in €. To see this, note that an individual acts strategically whenever 
n+et+Apr >0>n+6. In particular, the “average” individual (6; = 0) always votes 
strategically. For small €, many more inviduals satisfy this inequality, for fixed 7 and Apr. In 


fact, conditioning on an accurate signal realisation of 4 = 7, the probability of a strategic vote 


0 (49 (14 OD) 


is: 


This is clearly decreasing in €. 


Additional observations are pertinent. First, notice that in a “close” election, the incentive for 
(and amount of) strategic voting is small. An election becomes more marginal as 7 decreases to 
1/2. It follows that ®-'(y7,) | 0, and the incentive to vote strategically disappears. Secondly, as 
candidates 2 and 3 become closer, 7 | 0, and the expected incentive again disappears, with the 
probability of a strategic vote. The model predicts less strategic voting in a close election. Third, 
notice that the incentive (and hence amount) of strategic voting is finite. Uncertainty over 7 
leads to a bounded likelihood ratio of pivotal probabilities. The decision-theoretic outcome is 


thus distinctly non-Duvergerian in nature. The final observation is to note that the decision- 
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theoretic outcome tends to the Duvergerian case as the precision of public information increases. 
More formally: 
u>O => lim =+00 
My—>oo 
It follows that the Duvergerian conclusions of previous decision-theoretic analysis hinge on 
exact precision of public information — equivalent to complete and common knowledge of 


constituency-wide preferences. These results mimic those of Part 1, with the addition of fully 


specified voter preferences. 


Of course, full insight requires a game-theoretic analysis. In the next section, the argument 


considers the properties of equilibria in this model. 


2.4 EQUILIBRIUM 


2.4.1 Equilibrium Voting 


The decision-theoretic analysis considered the nature of optimal voting, assuming that other 
voters act truthfully. A rational voter will anticipate strategic behaviour by others, leading to 
a revision of the perceived log-likelihood A. Linking the optimal voting rule with the standard 
consistency requirement yields a Bayesian Nash equilibrium. The previous section was con- 
cerned with finite A, and first attention must turn to the possibility of extreme Bayesian Nash 


equilibria. 


Proposition 5. There exist two Duvergerian Bayesian Nash equilibria, with x2 =n+1 and 


r3 =n+1 respectively, resulting in a certain win by either candidate 2 or candidate 3. 


Proof. Suppose that all n + 1 voters satisfying uj; < min{uj2, U3} vote for candidate 2. The 
result is a certain win by candidate 2. There is exactly zero probability of a tie, so that 
g2 = q3 = 0. Voters receive the same expected payoff from a vote for either party j € {2,3}, 


and there is no incentive to deviate. An identical argument establishes that 73 = n + 1 is also 


an equilibrium. | 


Under Duvergerian voting, the log-likelihood ratio \ is undefined, since there is exactly zero 
probability of a tie for the lead. The key question of interest is the existence of non-Duvergerian 
Bayesian Nash equilibria. In this case, with finite A, the consistency of voter perceptions and 


the realised distribution over voter outcomes may be imposed. 


Proposition 6. There is a unique symmetric non-Duvergerian equilibrium X* satisfying: 


ye — 282°) 
a? — 26O-1(91) 


(2.6) 


Part 2. Strategic Voting with Constituency Uncertainty 3l 


Proof. Set * = \(A*) in Equation (2.4) and solve. O] 


In common with the decision theoretic case, this non-Duvergerian equilibrium entails no strate- 
gic voting whenever the lead of candidate 1 is low (7 | 1/2) and when the expected support 
for candidates 2 and 3 is the same (ju = 0). Two issues remain, however. First, an equilibrium 
selection problem arises. Second, the direction of strategic voting is unclear. In the decision- 
theoretic case, ps > 0 led to Apr > 0 and hence strategic voting away from candidate 3, as one 
might expect. The same conclusion is not immediate here. The denominator of Equation (2.6) 


may be either positive or negative, and the direction of switching is ambiguous. 


2.4.2 Equilibrium Selection 


The resolution of both of these problems is aided by the following definition. 


Definition 3.An election is Duvergerian if 0?< 2€@—!(y,), non-Duvergerian if 02> 2€®7!(4). 


The issue of equilibrium selection is first addressed. Suppose that voters have an initial hy- 
pothesis for decision rule employed by the electorate, yielding log-likelihood ratio Ag — for 
example, truthful voting corresponds to Ay = 0. A (decision-theoretic) best-response to this 
yields a realised log-likelihood ratio of Ay = Ao): A sophisticated voter may anticipate this 
response throughout the electorate. If all voters anticipate similarly, the realised log likelihood 


ratio becomes Ag = AA). Continuing iteratively, a best-response process is obtained. 


Definition 4. Define the iterative best response process by \y = OG): 


Previous authors (including Fey 1997) specified a process of repeated elections. Voters observe 
the outcome of an election, and adopt a best response to the behaviour represented by the 
election outcome. In a second election, more strategic voting is observed. This process is 
reinforcing, leading to Duvergerian equilibria. With instrumentally rational agents, repeated 
elections are unnecessary for this process. A rational voter will anticipate the strategic actions 
of others, and further anticipate their reaction. Naturally, the limit of such a sophisticated 
reasoning process requires the specification of a starting point. An initial hypothesis of truthful 


voting provides a reasonable and intuitive focal point for such an initial condition. 


Proposition 7. In a non-Duvergerian election, the iterative best response process converges 
to the unique non-Duvergerian equilibrium from any initial Xo. In a Duvergerian election, the 
iterative best response converges to a Duvergerian equilibrium, unless X59 = r*. If Ao > X*, it 
converges to the Duvergerian equilibrium in which candidate 3 receives no support. If Xo < r*, 


it converges to zero support for candidate 2. 


Proposition 7 is best understood with reference to Figures 2.1 and 2.2. Consider first the non- 


Duvergerian case, and suppose that the initial hypothesis is that voters act truthfully. This is 
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Figure 2.1: Convergence to a Stable Non-Duvergerian Equilibrium 
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Figure 2.2: Divergence from an Unstable Non-Duvergerian Equilibrium 
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A 


equivalent to optimal voting with Ao = 0, and a best response to this is Apr = A(0). As Figure 
2.1 illustrates, the best-response process continues, and converges to the non-Duvergerian equi- 
librium A*. That the initial hypothesis is truthful voting is unnecessary in a non-Duvergerian 
election — as the figure indicates and Proposition 7 confirms, any initial non-Duvergerian hy- 
pothesis leads to the non-Duvergerian equilibrium. Figure 2.2 illustrates the Duvergerian case. 
Notice that A* < 0 when yu > 0. Beginning at A5 = 0, the reasoning process leads explosively 
away, and support for the third candidate collapses. Proposition 7 is employed as an equilibrium 
selection device, supporting the non-Duvergerian equilibrium if and only if the non-Duvergerian 


criterion is satisfied. 


A restatement of the model in terms of the public survey microfoundation is instructive. Again 
noting that 0? = €?/m,, the criterion for a non-Duvergerian election (and hence equilibrium 


outcome) becomes: 


1 2671 
S (11) 


My ‘a 


It follows that an election is Duvergerian if and only if the public signal of candidate support 
is sufficiently precise, relative to a measure of the support for candidate 1 (7) and the het- 
erogeneity of voter preferences (£). Notice that a Duvergerian outcome obtains if candidate 
1 is sufficiently strong (large 7). In addition, increased heterogeneity of preferences (large &) 
yields a non-Duvergerian classification. The non-Duvergerian equilibrium log-likelihood ratio 


becomes: 


2m! (91) eu 


iy Ee 2myP*(N)®™* (02) 
€ — 2mO-1(y) 


. 1 — Im, B-"(n)/é 


= Ei) = 
A number of the leading comparative statics from the decision-theoretic analysis are maintained. 
For 44 > 0, the non-Duvergerian case leads to strategic switching away from candidate 3. The 
incentive to vote strategically is once again increasing in the precision of public information 
(m,,) and the perceived relative strength of candidate 2 (j:). Unlike the decision-theoretic case, 
however, a decrease in € results in an increase in the strategic incentive. This stems from the 
self-reinforcing nature of strategic voting. The initial incentive \1(0) does not depend on €. 
With lower €, however, a smaller incentive is required to encourage a voter to act strategically. 
This further enhances the incentive. A final point is the continuity between the Duvergerian 


and non-Duvergerian cases. Indeed: 


g 


MX + 0o as ™u 1 55-1) 


Conditioning on accurate signal realisation, the probability that a strategic vote is observed 


may be calculated. An individual acts strategically whenever 7 + 6 + X* > 0 >n+6. With 
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fu = n, this inequality is satisfied with probability: 
2m, 87! (41) 
Pr[Vote 2U wis > ui2| min{uia, ui a] = (oe! 1 . - 
r[Vote 2U m3 > we | min{uje, wis} > uit] ( (92) ( Hem, OG) a) 


2.5 


The results of Section 2.4 make it clear that game-the 


ILLUSTRATION 


oretic considerations lead to greater 


strategic voting. How significant is this? This section provides an illustration. 


Return to the example of Part 1. The expected vote shares of the three candidates are: 


Conservative 
Labour 


Liberal Democrat 


Candidate Share 
1 


0.38 
0.32 
0.30 


Using this specification, 7, = 0.38/0.62 and y = 0.32/0.62. Assume for simplicity that the 


realised signal of the common component equals its true value, so that up = 1 = €671(y2). 


There are two parameters remaining. These are the heterogeneity of voter preferences (€) and 


the precision of the public signal (m,.). 


Varying € yields a surprisingly rich collection of payoff distributions. To illustrate this, first 


normalise ujzy = 0 and uj + ug = 1. It follows that 0 < 


wig <1. This normalisation allows a 


convenient intepretation of the payoff uj2. Suppose that voter 7 is indifferent between candidates 
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1 and 2. This happens whenever qo(u2 — wi) = 93(t43 — Ui). Applying the normalisation: 


g3 = 
g2 + 43 


Ui2 


This is the conditional pivotal probability of a LibDem-Con tie that is just sufficient to ensure 
a vote for the LibDem candidate. It follows that this is a measure of the relative strength of a 


voter’s preference for the Labour candidate. 


The distribution over u2 for various values of € is illustrated in Figure 2.3. High heterogeneity 
corresponds to € = 10. The distribution over candidate allegiance is clearly bimodal: Voters 
are either strong Labour or strong Liberal Democrat supporters, and hence require stronger 
strategic incentives in order to move away from straightforward voting. Contrast this with the 
case of € = 0.4. The distribution is unimodal, with most weight towards the centre. This 


corresponds to near indifference between the Labour and Liberal Democrat candidates. 


Figure 2.4 illustrates the probability of observing a strategic vote for the example, using the 
three values € € {0.4,1,10}. In the public information model, all switching is uni-directional, 
with voters switching from the Liberal Democrat candidate (i = 3) to the Labour candidate 
(i = 2). This means that strategic voting is bounded above by 0.3. The figure considers 
information precision in the range 0 < m, < 25. Strategic voting is clearly increasing in this 
information precision, and decreasing in heterogeneity €. Notice, however, the dramatic effect 
of moving from a decision-theoretic to a game-theoretic perspective. For instance, in the case 
of € = 1, an information precision of only m, = 1.74 yields a fully Duvergerian outcome: 
This is the critical value satisfying m, = €/26~!(7). This example suggests that, with only 
modest information precision, self-reinforcing strategic voting leads to strict bipartism. Such a 
conclusion, however, is premature. The full analysis of Part 3, including an account of private 


information sources, is required. 


2.6 CONCLUSION 


It is clear that the public information case provides an interesting benchmark. In a non- 
Duvergerian election, there is uni-directional tactical voting away from the commonly expected 
trailing candidate. Strategic voting incentives are increasing in the precision of public infor- 
mation, the lead of candidate 1 and the perceived distance between candidates 2 and 3, and 
decreasing in the idiosyncrasy of voters. With increasing public information, the equilibrium 


tends to a Duvergerian outcome, achieving this at a finite precision level m,. 


It would be appear from this analysis that the strict version of Duverger’s Law — complete 
local bipartism — might still emerge from a more general model. Indeed, the illustrations of 
Section 2.5 suggest that only a modest amount of information is required to destablise the 


non-Duvergerian equilibrium. 


Note, however, that the analysis of this part replaces only one assumption of previous models 
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Figure 2.4: Probability of a Strategic Vote from LibDem to Lab for Various m, 
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— the constituency-wide support of candidates 2 and 3 is no longer certain, but all voters hold 
common beliefs. This issue is critical, and is addressed by the analysis of the private signals 


case, to which the argument now turns. 


PART 3 


STRATEGIC VOTING WITH PRIVATE SIGNALS 


“In strategy it is important to see distant things as if they 


were close and to take a distanced view of close things.” 


Miyamoto Musashi 


3.1 INTRODUCTION 


Strategic voting does not lead to strict local bipartism. In a decision-theoretic context, this 
message is clear. Indeed, the analysis of Part 1 demonstrates convincingly that the indepen- 
dence of voter preferences is the key assumption behind the strict version of Duverger’s Law. 
Of course, the game-theoretic analysis of Part 2 appears to resurrect the bipartite outcome. 
Does it? This part argues not. With public signals as the sole information input to the deci- 
sion making process, the constituency-wide strength of candidates is uncertain. Voter beliefs, 
however, remain common. It follows that the expected leader is commonly known. This un- 
palatable feature means that constituency-level uncertainty — the unknown common effect — 
is insufficient to avoid a Duvergerian outcome. A model is desired in which voters differ in 
their opinion of which candidate is the likely leader. Indeed, this is a feature that has been 
recognised in the empirical literature. In their critique of the Niemi et al (1992) measure of 


strategic voting, Heath and Evans (1994) make the following observation: 
“The Niemi et al measure] does not allow for the possibility that some people may 
intend to avoid wasting their vote, but may be mistaken in their perceptions of the 


likely chances of the various parties winning the constituency.” 


This feature is included here, where a rather different equilibrium conclusion is reached when 


private information sources are taken into account. 
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3.1.1 Self-Attenuating Strategic Voting 


Consider once again the standard intuition behind strategic voting. Some rational voters will 
find it optimal to switch from their preferred candidate to one of the leading pair. This reduces 
the support for the trailing candidate, and enhances the incentive for others to act strategically. 
Continuing iteratively, this might (and in Part 2, often does) lead to an equilibrium in which 
all support for the trailing candidate vanishes. This informal discussion suggests that strategic 
voting is self-reinforcing. But is it? In fact, this part argues that, contrary to immediate 


intuition, strategic voting may well be self-attenuating. 


To explore this issue, voters are equipped with two different sources of information. They 
continue to commonly observe a public signal of the common component of preferences — a 
public signal of constituency-wide support for the candidates. As in Part 2, this may be likened 
to the observation of an opinion poll. Crucially, each voter also observes a private signal, based 
on the observation of other individuals in the society. Conditional on the common component, 


this is independent across the electorate. 


Part 2 considers only the pure public information case. In that case, strategic voting is indeed 
self-reinforcing, exhibiting strong positive feedback. If the public signal is sufficiently strong, 
then the fully Duvergerian equilibria are uniquely stable. The illustrative examples of Section 


2.5 suggests that the required information precision is rather small. 


With pure private information, this is no longer the case. Strategic voting by others reduces 
the incentive for a voter to act strategically. Why? Essentially, voters fear the prospect of 
strategic voting in the opposite direction to that indicated by their privately-observed signal. 
In the public information case, this issue is simply not present — all voters observe the same 
signal, and strategic voting is uni-directional. Intuition is built by considering a voter eval- 
uating the likelihood of either of two challenging candidates tying with a leading candidate. 
If voters act straightforwardly, then a tie for the lead requires one of the challengers to be 
significantly stronger than the other. A strategic voter then considers the relative likelihood 
that each of these challengers has sufficient strength throughout the constituency to force a 
tie. Suppose instead that other voters act strategically, responding strongly to their private 
signals of candidate support. Only a small asymmetry in strength between the challengers is 
necessary to force a tie. A small advantage by one is magnified by strategic voting throughout 
the electorate. Reflecting on this, a particular voter will weight the relative likelihood of each 
challenger having a small advantage over the other. This likelihood ratio is less extreme than 
in the truthful voting case, and hence there is a lower incentive for a voter to act strategically. 
It follows that strategic voting exhibits negative feedback. Indeed, as this informal discussion 
reveals, the equilibrium incentive for strategic voting in a game-theoretic environment is in 
fact less than that in a decision-theoretic environment. This rules out the possibility of stable 


Duvergerian equilibria. 


The combination of public and private information is the remaining case. The analysis shows 
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that if private information is sufficiently precise relative to public information, then the uniquely 
stable equilibrium is non-Duvergerian. Which information source is likely to dominate? Cer- 
tainly, during a national campaign for a legislative election, a large number of public information 
sources are available. Crucially, however, voting takes place at the local level. National opinion 
polls, for instance, do not reflect local effects, which remain unknown to local constituents. 
At a constituency level, therefore, one might expect public information to be relatively noisy. 
Private information dominates, and a non-Duvergerian conclusion emerges. By contrast, if the 
vote takes place at a national level — for instance, a national referendum or national leadership 
election — the presence of strong public information sources is likely to dominate. The theory 


predicts multi-party support at the district-level, and bipartism at the national level. 


3.1.2 Comparative Statics 


The comparative statics mimic those of Parts 1 and 2. The key difference is the role of informa- 
tion. Private information has a far smaller effect than public information. Indeed, beginning 
with a Duvergerian election, a non-Duvergerian outcome obtains when sufficient private infor- 
mation is added. It follows that strategic voting may sometimes be decreasing in the information 
available. Perhaps most importantly, in the pure private case, strategic voting is bounded above 


rather than below by the decision-theoretic level. 


3.1.3. Guide 


The remainder of this part completes the argument the new theory. Section 3.2 extends the 
model of Section 2.2 to include privately observed signals. Voter behaviour and the nega- 
tive feedback inherent in strategic voting is investigated in Section 3.3. Section 3.4 calculates 
the equilibria and makes a selection. The results are illustrated in Section 3.5, prior to the 


concluding remarks of Section 3.6. Omitted results are relegated to Section 3.7. 


3.2. MODEL 


3.2.1 The Election 


The specification of the election is identical to that of Parts 1 and 2. The reader is referred to 
Section 1.2 on page 5 for full details. There are three candidates j € {1,2,3} in a single-seat 
district selection, operating under the plurality rule. Realised vote totals will be denoted {z;}, 
where 5) a; = (1+ 71)n+1. The vote total for candidate 1 is fixed at x; = yn, and remaining 
voters prefer both of candidates 2 and 3 to candidate 1. Assumption 1 is upheld (y > 1/2), 


4 


ensuring that any tie for the lead involves candidate 1. All references to a “voter” and the 


“electorate” will refer to the n + 1 remaining voters. 
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3.2.2. Preference Structure 


The preference structure is identical to that adopted in Part 2. Voter i has von Neumann- 
Morgenstern utility uj; for party j, satisfying: 
eee 
log @ 4" = n+, (3.1) 
UWi3 — Usd 
Here 7 and ¢; are the common and idiosyncratic components of utility. Maintaining Assumption 


3, the idiosyncratic component is distributed independently across voters, with distribution: 


e~ N(0, €7) 


3.2.3. Public and Private Signals 


The public signal of Part 2 is augmented by a privately observed signal. Voters begin with a 
diffuse prior over 7 — they have no knowledge of average candidate support. Information on 
candidate support may then be gleaned from a variety of sources. The first is public information, 
such as the observation of a public opinion poll or similar survey. Assumption 4 is retained, 
so that voters commonly observe . ~ N(n,o7). Following observation of this signal, and 


prior to the receipt of any private information, voters update to a common public posterior 
n~ N(p, a”). 


Viewed as an opinion poll, Assumption 4 provides a natural framework. In particular, the 
widespread publication of opinion polls is common during an election. For large m,, this leads 
to a high precision of public information. Notice, however, that opinion polls typically occur at 
a national level. They reflect nationwide support for political parties, but rarely give accurate 
constituency level information. This feature was present in the 1997 British General Election. 
Evans, Curtice and Norris (1998) note that 47 nationwide opinion polls were conducted during 
the election campaign. By contrast, only 29 polls were conducted in 26 different constituencies 
at a constituency level, out of a total of 659 constituencies. To the see the effect of this, 
decompose the common effect 7) into national and district components, so that 7 = 7) + (7 — 7) 
where 77 is the national component. Even with perfect knowledge of 77, uncertainty over the 
district component remains, leading to a public posterior belief of 7 ~ N(j, var[n —7]). At the 
constituency level, the absence of opinion poll information for the district lowers the precision 


of public information. 


Even in the presence of a constituency level opinion poll, it is perhaps less likely for the poll 
results to be publicly observed by all members of a constituency. Publicity over such results 
is often provided by political parties themselves, and these are often manipulated to suit the 


1 


interests of the candidates.’ It is thus reasonable to expect the level of information at a 


'The 1997 British General Election provides an example. In the Oxford West and Abingdon constituency, 
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constituency level to be rather low. Voters do, however, have other sources of information 
available. In particular, a signal of constituency-wide candidate support may be obtained from 
the people with whom an individual interacts. The important characteristic of such information 


is that it leads to private signals. 


Assumption 5. Each voter i observes a private signal 6; ~ N(n,k7). Conditional on n, this 


is independent across voters but may be correlated with the idiosyncratic component €;. 


Once again, a microfoundation is available. The private signal 6; corresponds to the observation 
of mg randomly chosen individuals, with k? = €?/mg. In particular, a voter’s own payoffs are a 
signal of 7. Hence with ms = 1, it follows that «2 = €?. More generally, with this microfounda- 
tion, the variance of private signals is bounded above, with x? < €?. The inclusion of a voter’s 
own preferences in the signal results in correlation between the signal and idiosyncratic utility 
component ¢;. For instance, in a sample of ms > 1: 
1 ae 
6; =n+— at > « => cov(6;,6) = 2 =K? 
m6 Pwr m6 


This feature is incorporated into the analysis, and extends easily to further correlation between 
the preferences of voter 7 and sampled individuals. Defining the correlation coefficient between 


6; and €; as p, the microfoundation presenteded here yields: 


P25 0 


mls 


Following the observation of 6;, a voter updates to obtain a private posterior belief. 


Lemma 6. The posterior belief of voter i over 1 satisfies: 


N K+ ob; Ka? 
q er aa ee 


Proof: Apply the standard Bayesian updating procedure — see DeGroot (1970). O 


The specification of private signals implicitly assumes that sampled individuals reveal their pref- 
erences truthfully. Furthermore, since the unconditional probability of influencing the election 
outcome is vanishingly small, it seems unlikely that individuals would find a costly information 
acquisition exercise to be worthwhile. The argument presented here accepts this latter critique. 
If a voter finds it too costly to conduct a private opinion poll, then the strategic manipulation 
of voting intentions by sampled individuals is no longer of relevance. The question of the pri- 


vate information source remains. It is envisioned that his information is accumulated over an 


the “state of play” of the various candidates was publicised by the political parties themselves. The Labour 
and Liberal Democratic candidates both argued that they were best-placed to take the various seats. Indeed, 
the author received a variety of conflicting leaflets during the campaign, and eventually decided to ignore their 
content. 
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extended period of time prior to an election, in the course of daily activity. It seems unlikely 


that a sampled individual would find response manipulation worthwhile over such a time frame. 


3.3. VOTER BEHAVIOUR 


3.3.1 Optimal Voting 


Once again, optimal voting is determined by pivotal considerations. With private information, 
the perceived pivotal log-likelihood ratio may vary throughout the electorate. Recall Definition 


2, and introduce the subscript 2 so that: 


Vote2 & iby 

WiZ — Vil 

The focus will be on symmetric strategy profiles, and hence symmetric belief rules. It follows 

that the perceived pivotal log-likelihood ratio depends only on a received private signal 6;, and 

not on the voter’s identity 7. Thus A; = (6;). It follows that a voter supports candidate 2 
whenever 7 + & + A(6;) > 0. 


3.3.2 Affine Belief Rules under Best Response 


Begin by restricting to optimal voting given beliefs. A strategy profile is then entirely deter- 
mined by the belief rule \(6;). Are there any particular belief rules that are likely to arise in 


equilibrium? The analysis will reveal the importance of affine belief rules. 


Definition 5. An affine belief rule satisfies \; = a + b6;. 


The parameters a and b may depend on the realisation of the public signal ~ as well as the 
parameters of the model. Notice that truthful voting corresponds to an affine belief rule with 
a=b=0. Furthermore, Duvergerian equilibria will always be present, obtained by setting 
b = 0 and extending the real line to set a = too. Focus on affine belief rules is justified by the 


following lemmata. 


Lemma 7. For an unboundedly large electorate (n — co) any smoothly increasing belief rule 
A(6;) maps to an affine belief rule under best response: When all adopt X(6;), a best response 
is to adopt an affine belief rule (6;) = a+ b6; for some @ and b. 


Proof. See Section 3.7. O 


Lemma 8. The class of affine belief rules is closed under best response. If all adopt a belief 
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rule (6;) =a+6;, then a best response is to adopt an affine belief rule \(6;) = +6; where: 


bla(K? +07) + (1+ b)K74] 
o7(1+b) 

2kO-1 (4) 

K2(1 +b) 

= varie, + b(6; — 7) | n] 


Proof. The first part of this lemma is a simple corollary of Lemma 7. The generation of the 


explict formulae above mimics the proof of Proposition 4, and is described in Section 3.7. O 


Following Lemmata 7 and 8, attention will now turn to the properties of affine belief rules. 
Imposing best response, any rules will remain in this class. First consider the private component 


b, which describes the response of a voter to the private signal. 


Lemma 9. The mapping b(b) is strictly decreasing in b, with: 


6(0) = ae and lim 6(b) = ae) 


boo K 


Proof. See Section 3.7 O 


A decision-theoretic conclusion is immediate. Straightforward voting is equivalent to optimal 
voting with a = b = 0. A decision-theoretic voter calculates the optimal voting rule, given 


straightforward voting by all others. This yields: 


_ OW and bpp — BPM) 


apT 
o2 K2 


From this, the pivotal log-likelihood ratio becomes: 


= =A be bi 
Apr (bi) = 28" (1) (4 + =| 
This mirrors the decision-theoretic analysis of Part 2. Attention now turns to the presence of 


negative feedback with private information. 


3.3.3 Self-Attenuating Strategic Voting 


Using Lemma 8, notice that b(b) is decreasing in b. An increase in the response to signals by the 
electorate at large reduces the weight placed on the signal by the typical voter: An increase in 
the tendency to vote strategically by others reduces the incentive for a voter to act strategically. 
Strategic voting with pure private information is not self-reinforcing, in stark contrast to the 


public information case. 
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Figure 3.1: Self-Reinforcing Strategic Voting with Public Signals 
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Figure 3.2: Self-Attenuating Strategic Voting with Private Signals 
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The intuition is aided by Figures 3.1 and 3.2. Consider first the public case. Following obser- 
vation of the public signal jz, a typical voter holds a common public posterior belief over the 
common utility component 7. This is illustrated in Figure 3.1. A randomly selected individual 
votes for candidate 2 whenever 7+ A+; > 0. For a tie with candidate 1, this requires a critical 
value of 7 = €6—1(7,) — A. Similarly, a tie between candidates 1 and 3 requires a critical value 
of n = €6-!(1—+,) —X. Increasing \ reduces both of these values and, for jz > 0, increases the 


likelihood ratio of 1-2 tie versus a 1-3 tie. It follows that strategic voting is self-reinforcing. 


By contrast, the private case is illustrated in Figure 3.2. First consider the case of a=b=0. A 
tie between candidates 1 and 2 requires a critical value of 7 = €6—1(y,). For b > 0, a randomly 
selected individual votes for candidate 2 whenever 7 + 66; + €; > 0, requiring a critical value 
of n = k&—-'(y)/(1 +b). This is decreasing in b. Moving to a 1-3 tie, the critical value of 
n = €®-!(1 — 1) increases to 7 = KO-1(1 — y)/(1 +6). This follows, since 6-'(1— 4) < 0. 


The critical values are then closer, so that the likelihood ratio moves closer to 1. 


This may at first seem counter-intuitive. When 0 is high, voters respond strongly to their 
signal. In particular, this increases the likelihood of a tactical vote. Importantly, it increases 
the probability of a strategic vote in both directions. It may seem that a voter receiving a signal 
6; > 0 may view the prospect of a 1-3 tie as extremely unlikely. For high 6;, however, a win by 
candidate 2 seems almost certain. To affect the outcome, however, a voter conditions on the 
event of a tie. For high 6;, a tie can only occur if the voter’s signal has overstated the support 


of candidate 2. It is then reasonable for the voter to consider true values of 7 satisfying 7 < 0. 


Summarising, voters become worried that there may be tactical switching in the opposite di- 


rection to that indicated by their private signal. 


3.4 EQUILIBRIUM 


3.4.1 Equilibrium Voting 


Once again, there are always Duvergerian equilibria present. Given that all n+1 voters support 
candidate j € {2,3}, the pivotal log-likelihood ratio is undefined, and there is no incentive for 
voters to switch. A non-Duvergerian equilibrium may be available. Based on the results of the 
previous section, it is sufficient to examine the class of affine belief rules. A non-Duvergerian 


equilibrium then corresponds to a finite pair {a*,b*} such that b* = b(b*) and a* = a(a*, b*). 
The properties of the the mapping b(b) immediately yield a unique fixed point. 


Lemma 10. The mapping b(b) has a unique fixed point b* . For p> /€, this satisfies: 


gE 0D <br < ieee) {1+ ee “} (3.2) 
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For the microfoundation case, with p = k/€, the bound may be refined to: 


26-1(y,) 4, . 1m) (G-1(1))? + & 
Ee Se p-n/7Gree 


Proof. See Section 3.7. O 


Any non-Duvergerian equilibrium must entail b = 6*. It remains to consider fixed points of 4. 


Notice that @ is affine in a, yielding a unique non-Duvergerian equilibrium. 
Proposition 8. For a? 4 b*«? There is a unique affine non-Duvergerian equilibrium: 


eet 
ac ey Ta 


Proof. Straightforward solution to a* = G(a*, b*). O 


Once again, there are multiple equilibria. Moreover, reprising the analysis of Part 2, the 
direction of the response to the public signal jz is ambiguous, since the denominator of a* can 


be either positive or negative. 


3.4.2 Equilibrium Selection 


An equilibrium selection problem is posed. In the private signal case, there is a unique non- 
Duvergerian equilibrium, together with two fully Duvergerian equilibria. Stability is once again 
employed as an equilibrium selection device. Consider an iterative best response process. As 
in Section 2.4, it is natural to specify an initial hypothesis of straightforward voting. This is 
trivially an affine belief rule, as it corresponds to a9 = bb = 0. Lemma 8 ensures that the 


best-response process remains within the class of affine belief rules. 


Definition 6. Define the iterative best response process by bk = b(b,-1), at = Q(at-1, 4-1). 


The mapping b(b) and associated process {h;} is not contingent on a;, and hence may be 


considered in isolation. 


Lemma 11. 0* is globally stable in the iterative best response dynamic: b, — b* as t — oo. 


Proof. See Section 3.7. O 


Whereas the formal proof of Lemma 11 is algebraically tedious, a diagrammatic illustration 
proves useful. Figure 3.3 plots the best response function b(b), illustrating the convergence to 
the fixed point. Notice the cyclic behaviour — this is a consequence of the negative feedback 


inherent in strategic voting with private signals. Begin with bj) = 0. There is a strong response 
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Figure 3.3: b(b) and Convergence to Equilibrium 


to the private signal — this is the decision theoretic case, yielding b; = bpr. Taking the next 
step, the focal voter recognises the strategic behaviour of others. This attenuates the response 
to the private signal, with a consequent reduction in b. Of course, this behaviour leaves open 
the possibility of a limit cycle in the iterative best response process. Lemma 11 ensures that 


the cycling dampens down, eventually converging to the unique fixed point b*. 


To select an equilibrium, now turn to the mapping (a, b*). 


Proposition 9. If 0? > b*k?, then a* is uniquely stable. A sufficient condition is for: 


o? > K-19) ( ie OU e2e a 


Dah) 


If this holds, then the non-Duvergerian equilibrium is uniquely stable, and attained as the limit 


of the iterative best response process from any finite starting point. 


Proof. This follows from the affine nature of the mapping a. The sufficient condition is obtained 


by employing the upper bound on b* from Lemma 10. O 


Corollary 1. With pure private information (c2 — co), the non-Duvergerian equilibrium is 


uniquely stable, with a* = 0. 
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It follows that the non-Duvergerian equilibrium is selected whenever the public information 
source is sufficiently imprecise. This requirement is also present in the public information 
model of Part 2. Notice, however, that the precision is judged relative to the precision of the 
private signal.? Hence if private signals are relatively more important than public signals, a 
non-Duvergerian outcome emerges. As Corollary 1 confirms, with only private information, 
the equilibrium is always non-Duvergerian. Which situation is likely to obtain? In a national 
referendum or similarly nationally conducted election, there are typically many public informa- 
tion sources. Moreover, region-specific effects are unimportant. It follows that public sources 
are likely to be more important than private. At a district level, however, commonly observed 
public signals are likely to be fewer. It follows that private signals are the primary source of 


information. A non-Duvergerian conclusion emerges. 


3.4.3. Comparative Statics 


The comparative statics for the private information model mimic those of both the decision- 
theoretic (Part 1) and public information (Part 2) models. In particular, the incentive to 
vote strategically is increasing in the strength of candidate 1 (y,) and the relative strength of 
candidate 2 over candidate 3 (72). To see this, specialise to the case of pure private information, 
so that a* = 0 and \(6;) = b*6;. Taking expectations, E[\;] = b*€®~1 (72). Examining the best 


response function b(b) : 


a 2B-1 (91) /E?2 + 2K? + 2oKEd 
7 k2(1 +) 


The best response (and hence the fixed point b*) is increasing in 71. 


There key differences between this and the previous cases. The first is that strategic voting is 
bi-directional; depending on the outcome of the privately observed signal 6;, a voter may switch 


in either direction. Indeed, given 7 > 0, an “incorrect” signal occurs with probability: 


Pr[é, <0] =Pr[2— 1 < —4| = 0 (2) =1-0 (2) 


K K K 


The second key difference is the effect of information precision on the strategic incentive. To 
investigate this, specialise to the microfoundation case of p = k/£€, and employ the upper bound 


on b* from Lemma 10: 


BIA] = SE) < SIP) 


?In fact, the condition presented here is of the same form as that in Morris and Shin (1999). They consider 
the coordination problem of bank runs, and find that the ratio of the variance of a public signal and standard 
deviation of a private signal determines the uniqueness of equilibrium. 
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Figure 3.4: Probability of a Switch from 3 to 2 (Private Information, € = 0.4) 


Next, recall that €/K = ms. Furthermore, allow € | 0 to obtain: 


EL] © 2871 (92) 87" (1) ng 


There are two pertinent observations. First, as € | 0, the expected incentive approaches the 
lower bound given in Lemma 10. Second, notice the effect of ms. In particular, compare this 
with the decision-theoretic analysis of Part 2 — in particular, Equation (2.5, p. 29). In the 
decision theoretic case, the strategic incentive is increasing in the precision m. In the private 
case, however, the strategic incentive is increasing in \/m. It follows that there is less strategic 
voting than in a decision-theoretic world, and furthermore that the impact of increased signal 


precision is much less. 


3.5 ILLUSTRATION 


A last visit to the example of Part 1 proves useful in illustrating the comparative statics de- 
scribed in Section 3.4. Figure 3.4 illustrates the probability of a strategic switch from candidate 
3 to candidate 2. The specification is one of pure private information for € = 0.4 — it follows 
that voters are relatively homogeneous, and hence one might expect extensive strategic voting. 
This is not the case. Notice that strategic voting is significantly below the decision-theoretic 
benchmark. Negative feedback is the cause. Furthermore, strategic voting from candidate 3 
to candidate 2 overstates its effect — there may be individuals who switch in the opposite 
direction, offsetting the effect. Based on this example, and specifying reasonable precisions 
for privately observed signal precisions, strategic voting is far from complete — strict local 


bipartism is rejected. 
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3.6 CONCLUSION 


The new theory of strategic voting observes that the Cox (1994) and Palfrey (1989) models of 
strategic voting are driven by the assumption that voter preferences are drawn independently 
from a commonly-known distribution. This leads to the divergence of the pivotal log-likelihood 
ratios that are the critical determinants of optimal voting behaviour. Introducing uncertain 
common effects to voter preferences results in uncertain constituency-wide candidate support, 
and overturns this result — pivotal log-likelihood ratios remain finite. Moreover, it is only 
uncertainty over common effects that matters. This suggests that the Cox-Palfrey model is 


perhaps driven by the wrong factors. 


The introduction of uncertain common effects, and the careful modelling of voter information 
sources leads to an elegant model. The key réle of information is now clear — a fully Duvergerian 
outcome requires both precise public information and the absence of precise private information. 
The preferred model is the pure private information model of this part. As Section 3.2 argues, 
information at a district level is more likely to be private. The model predicts some, but not 


complete, strategic voting. 


Earlier models lack comparative static predictions. This is a necessary consequence of the strict 
bipartite prediction. Here, the comparative statics are clear. In particular, the model predicts 
less strategic voting in a close election. Strategic voting is also increasing in the homogeneity of 
the electorate. Evans, Curtice and Norris (1998) hypothesise that increased strategic voting in 
the 1997 British General Election was likely due to increased indifference between the Labour 


and Liberal Democrat challengers. This view is consistent with the new theory. 


Weaknesses remain. The model considered here specialises to the case of a three-candidate 
election. Of course, the three-candidate model is of interest in its own right — there are 
three main challengers in the majority of UK constituencies. More restrictive, however, is the 
assumption that the votes of candidate 1 are fixed. The introduction of uncertainty over x1 
may further reduce the strategic incentives in the model. To see this, notice that the absolute 
probability of a pivotal outcome is larger as the strength of candidate 1 vanishes. This will 
likely reduce the pivotal likelihood ratio of ties between the possible tying pairs. Perhaps more 
serious is the restriction on voter preference orderings. The structure of preferences rules out 
multi-direction strategic switching; all n voters are assumed to support either candidate 2 or 
candidate 3. In the case of 7 > 0, this ensures that candidate 2 is in fact a Condorcet winner. 


This restriction requires lifting. 


All of these issues are the subject of ongoing research. In addition, empirical testing of the model 
is already in progress, and experimental work is planned. Hopefully a greater understanding of 
strategic behaviour may lead to a better understanding of electoral systems. Alternatively, the 
reader may wish to adopt the sentiments of Winston Churchill, who claimed that “[t]he best 
argument against democracy is a five minute conversation with the average voter.” Perhaps a 


conversation with the pivotal voter would have confirmed his views. 
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3.7 OMITTED RESULTS 


3.7.1 Properties of b(b) 


This section analyses the properties of the best response function b(b) described in Section 3.3. 


Proof of Lemma 7: Consider an arbitrary smoothly increasing belief rule \(6;). Define: 
p= Pr[A(6i;) +7 + & 2 0|n| = A(n) 


This is the probability that a randomly selected voter supports candidate 2, given the common 
utility component 7. Given that A(6;) is smoothly increasing, H(7) is strictly increasing in 1. 
Write h(n) = H'(n). It follows that: 


F(p) =Pr[y < H7l(p)| = (ee) 


var |] 


This probability (via E(7) and var[n]) is conditional on the information available to a focal 


voter, and uses the fact that posterior beliefs over 7 are normal. Differentiate to obtain: 


ep eve (56) 
f(p) = h(H—1(p)) am var [7] 


Next, using Proposition 1 from Part 1, recall that: 


axe WO A) 
Be a fd-n) 


Evaluate this expression and take logs to obtain: 


er Gi) eee h(H*(1—)) — G7'(m) — Elm)? rr (H7*(1—) — Elm)? 
"Fa-n) A “G 1) Qvar[n) a 

= h(H7*(1-—1)) 7d-)?-Al (mn)? , Aon) “"(1-) 

ae h(H-1(91)) 2var{n| x = ee) 


Notice that this is affine in E[n]. Next recall from Lemma 6 of Section 3.2 that: 


N K7u+076; Ko? 
q oe tees 


Thus var[7| does not depend on 6;, and E[7] is affine in 6;, so that (6;) is affine in 6;. O 


Proof of Lemma 8: Suppose that each voter adopts the initial affine belief rule. This results in 


a voter for candidate 2 whenever: 


ntra+bé+6>20 & a+(1+b)n = —E — 5(6; -—7n) 
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Conditional on 7, the right hand side is normally distributed with zero expectation, and variance 
kK? = varle; + b(6; — )]. It follows that: 


a K®-!(p) —a 
p=H(n) =® (ao wy = n=H-y) = we i eh 
Differentiate to obtain: 
h(n) = HY(n) = Zo (SEAM) 5 pert) = Fo 1) 


Begin with the first term of Equation (3.3). First employ the symmetry of the normal distri- 
bution to note that -!(1 — 4) = —@7!(y), and that $(z) = ¢(—z). It follows that: 


ee (O70 =)) _ 
h(H~!(1)) o(®-"(91)) 


Next consider the second term of Equation (3.3). 


(R®7"(q1) — a)? [ROT* (1) P +a? — 2aKO7" (1) 
(1+)? 7 (1+)? 


H'(n)? = 


Similarly: 


KO-1(1 — y)]? + a? — 2akS-1(1 — RO" (1)? + a? + 2akKO=! 
H(1—y)? = Jee = eee 


It follows that: 


H'(-n)?-AT (nm)? _ 2akO7* (m1) 


2var|n]| ~ -var[n|(1 + 6)? 


The final term is simply: 


_ 2RO- (m1) +8) 


C+ ovary 


Assembling, obtain: 


f(m)_ _ 2k®7*(m)(a+ (1 + 5) Elm) 
f(l-™) var[n](1 + 6)? 


The next step is the evaluation of the expectation and variance of 7, conditional on the infor- 
mation of a typical voter. Once again, recalling Lemma 6 of Section 3.2 it follows that: 
Ku t+ 076; Keo 


E[n| = ge and var|1] = ey rea) 
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Substitute in to obtain: 


fly) 2ROT"(q1) (a(n? +07) + (1 + d)(K?2 + 076;)) 
fd-m) K207(1 +b)? 


Separate this out to obtain the affine function: 


2B" (qr )la(n? +07) +L +b)Kep) | 2791) 6i 


\= 
k207(1 + b)? K?(1 +b) 
Taking the intercept: 
QO "(ni )la(n* +07) + (1+ b)K*p) QHD" (Jo? [a(k? +07) + 1 +b) KPH 
k20?(1 + b)? ~  K2q2(1 40) o7(1+b) 
_ bla(n? +0?) + (1 +b)K7 4] 
os o7(1+b) 
This yields the desired result. | 


Proof of Lemma 9: From Lemma 8 the best response b(b) satisfies: 


i, = 2kO~! (1) 
~ K2(1 +5) 
This requires evaluation of &, which satisfies: 


kK? = varle; + b(6; — 9) |] = €2 + bn? + 2pnEb 


where p is the correlation coefficient between ¢; and 6;, conditional on the common utility 


component 7. Hence: 


2b-! (4) /E2 + B22 + Boned 


b= 
kK 1+) 
Evaluating the derivative: 
10) = 267! (71) bx? + pr€ Je? + PR? + QpKEb 
Ke? (1 + b)\/€2 + BK? + Qoned (Lao 
20-1 (41) /E2 + BK? + 2oKEb bw? + pw& 1 
K?(1 +b) €2+b?K2 + 20K€b 14+b 
A bK? 1 
= Fi K~ + pK& _ 
2+ b?K2 +20K€b 14+b 


This is decreasing for b > 0 if: 


bw? + pw& Z 1 
€2 + BK? + 2oKEb~ 14+b 
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Re-arrange this expression to obtain €(px — €) < bK(p€ — k) To check this inequality, first 
consider the right hand side. First p > «/€ by assumption — see Section 3.2. Since b > 0, 
it is sufficient to show that the left hand side is weakly negative, which requires € > pk. But 
this holds, since 0 < p < 1 and k« < €. It follows that the function is (weakly) decreasing 


everywhere. Next, evaluate at the extremes to obtain: 


- 2g@-! - 20-1 
6(0) = 28S Mh) ane 
K boo K 
These calculations yield the desired properties of the function. | 


Proof of Lemma 10: To obtain an upper bound for the fixed point b*, write b(b) as: 


a 20-1 24 $2K2 49 b 
b(b)K ~~ (m1) VE oP OTK se pgk 
K+ 0K 


Make the change of variable @ = Kb to obtain: 


4(3) — 22) VE? + P+ 2088 _ 20 Nn) (E+ 8) 
(GB) = K+ = B 


An upper bound may now be obtained by solving the equation: 


B —2671(y1)(€+ 8) =0 


This has a positive root at: 


a= oem) os 


Oty) 


It follows that an upper bound for the fixed point is: 


22H) O-1(y1) + 2€ 
b a, : {1+ Ey) 


This upper bound was obtained by setting p = 1. A tighter bound is available via a formal 
implementation of the microfoundation for the privately observed signal. In that case, the 


correlation coefficient satisfied p = «/€. The bound on the transformed equation becomes: 


Ke -1 / ¢2 2 2 D) 


It is clear that the right hand side is decreasing in &. Hence sending « | 0: 


8(8) < ey Gee? 
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To obtain an upper bound, solve the equation: 


B* — (2671(m))?(C? + 8”) =0 


This equation is quadratic in 67, and may be solved to obtain the positive root: 


GF (287* (m1)? + V2O*(m))4 + 408) PO 


Moreover, it is clear that this bound is attained as « | 0. 


3.7.2 Stability of b(b) 


Proof of Lemma 11: Consider the mapping: 


Notice that 6 is also a fixed point of B. Taking the derivative of this function: 


A A 


B'(b) = U(b())') 
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It follows that this is an increasing function, since b! < 0. Consider a generic fixed point }, 


satisfying B(b) = b. Evaluate the derivative at this fixed point: 


bn? 1 : bK2 
B'(b) | arm a ts - a} 8 ee BROS 


6246242 4+20K€b 1+6 


2+ b2k2+2pK€b = 1+b 


bn? + pr&b b . { b?K? + pK&b b 
6246242 4+20K€b 1+6 


€24.62K2 +20K€b  1+6 


It is clear that, for p > 0, both of these terms are less than one, and hence B’(b) < 1 at a 


fixed point. It follows that any fixed point must be a downcrossing. Further fixed points would 


require an upcrossing, and hence there is a unique fixed point b*. From this it follows that 


b; — b*. To see this, notice that bi+2 = B(b;). From the properties of B, there is the required 


convergence. 


O 
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